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Major  Department:  Statistics 

In  cancer  research,  some  of  the  patients  are  disease-free  after  certain 
treatments.  It  is  interesting  to  compare  treatment  efficacy  with  long-term  survival 
rates.  One  commonly  used  approach  in  analyzing  this  type  of  data  is  to  compare 
the  Kaplan-Meier  estimates  of  the  cure  rate.  Another  approach  is  to  apply  the 
mixture  model  proposed  by  Farewell.  However,  the  Kaplan-Meier  estimates  are 
unstable  toward  the  end  point,  while  the  mixture  model  is  computationally  too 
complex. 

To  overcome  these  difficulties,  we  propose  a test  based  on  a piecewise 
Gompertz  model  to  compare  drug  efficacy  on  the  cure  rate.  The  proposed  test  also 
accommodates  the  situation  where  patients  display  different  hazard  patterns  during 
different  treatment  stages.  In  this  work,  we  have  derived  the  strict  concavity  of  the 
log-likelihood  function  and  the  existence,  consistency,  and  asymptotic  normality  of 
the  maximum  likelihood  estimates  of  the  parameters.  In  addition,  our  Monte  Carlo 
simulation  study  shows  that  the  proposed  test  is  more  computationally  feasible 
and  more  powerful  than  the  test  based  on  Farewell’s  mixture  model.  Moreover,  an 
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example  is  given  to  show  the  utility  of  our  proposed  test.  A goodness-of-fit  test  of 
our  proposed  model  is  also  discussed. 


CHAPTER  1 
INTRODUCTION 

1.1  Problem  and  Common  Practice 

In  survival  analysis,  one  is  interested  in  comparing  the  efficiency  of  various 
drugs  or  treatments.  The  survival  probability  is  generally  assumed  to  converge  to 
zero  when  the  observation  time  becomes  large.  However,  it  is  not  uncommon  in 
cancer  research  that  a proportion  of  patients  are  disease-free  (relapse-free)  after 
certain  treatments.  These  patients  are  called  “long-term  survivors”  or  “cured.” 

For  instance,  in  clinical  trials  dealing  with  childhood  leukemia,  some  treatments 
do  cure  a proportion  of  patients  permanently.  This  type  of  long-term  survival 
pattern  is  different  from  what  is  dealt  with  under  the  traditional  survival  analysis. 
In  addition,  it  is  sometimes  necessary  to  assume  that  hazard  functions  are  different 
on  different  intervals.  A piecewise  exponential  model  proposed  by  Friedman  (1981) 
accommodates  this  assumption.  This  model  was  considered  semiparametric  by 
Aitkin  et  al.  (1989,  PP.  287-292).  They  stated  that  the  piecewise  exponential 
model  has  the  same  flexibility  as  the  proportional  hazards  model  in  which  the 
baseline  hazard  function  is  unspecified.  In  this  chapter,  we  first  introduce  the 
common  methodologies  in  the  literature  that  deal  with  cure  rate  problems.  We 
also  discuss  the  piecewise  exponential  model  proposed  by  Friedman  which  assumes 
different  hazard  functions  on  different  time  intervals. 

In  the  literature,  there  are  several  methods  dealing  with  the  long-term 
survival  problem.  They  are  as  follows: 

• Using  a mixture  model  which  assumes  the  underlying  population  is  a mixture  of 
two  populations,  patients  who  are  cured  and  patients  who  are  not  cured. 
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• Applying  a modified  binary  regression  model  in  which  the  main  interest  is  to 
compare  the  cure  rates  among  patient  groups. 
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• Using  nonparametric  methods  to  compare  the  survival  curves  or  to  compare  cure 
proportions  among  treatment  arms. 

• Assuming  a bounded  cumulative  hazard  rate  instead  of  a cumulative  hazard  rate 
with  an  infinite  limit. 

In  the  next  four  sections,  we  describe  each  method  in  more  detail.  Assume 
that  there  are  n patients  in  the  study.  For  the  patient,  let  Ti  denote  the  time  to 
relapse  (incidence),  Ui  the  censoring  time,  T*  = min(rj,  Ui),  Aj  = I[Ti<Ui],  and  x,  a 
p-dimensional  covariates  vector  associated  with  the  patient.  The  observed  data  are 
{t*l,  ^1,  Xi),  52,  X2),  . . . , (t;,  5n,  x„). 

1.1.1  Mixture  Model 

Let  Z be  a binary  variable  showing  that  a patient  belongs  to  either  the 
cured  population  or  the  noncured  population. 


Note  that  Z is  not  fully  observable  in  the  presence  of  censoring.  Clearly, 
if  the  incidence  is  observed,  then  A = 1 and  Z — 1;  if  the  time  to  incidence  is 
censored,  then  A = 0 and  Z is  not  observable.  Assume  that  the  probability  density 


Let  S*{t  I Z = 1)  be  the  corresponding  survival  function.  If  Si  = 1,  then  the 
contribution  of  the  patient  to  the  likelihood  is 


1,  if  the  patient  is  noncured; 


0,  if  the  patient  is  cured. 


function  for  the  time  to  incidence  is  f*{t  \ Z = 1)  for  the  noncured  population. 


nu  I z,  = i)p(z,  = 1). 


If  Si  = 0,  then  the  contribution  of  the  patient  to  the  likelihood  is 


P{Zi  = 0)  + P{Z,  = l)S*{U\Zi  = l). 


(1.1) 
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Therefore,  the  overall  likelihood  function  is 

n 

L = Y[{P{Zi  = l)r{t*  I Z,  = 1)}'^P(Z,  = 0)  + P{Zi  = I Z,  = 

(1.2) 

Two  influential  articles  addressing  this  method  were  written  by  Farewell 
in  1977  and  1982.  In  the  first  article,  he  expressed  the  binary  variable  Z using  a 
logistic  regression  model 

where  /3  is  a vector  of  coefficients  associated  with  the  vector  of  covariates  x. 
Farewell  (1977)  also  assumed  the  time  to  incidence  T follows  an  exponential 
distribution  with  mean  1/A,  and  A is  assumed  to  be  known.  Thus,  when  = 1,  the 
contribution  of  the  patient  to  the  likelihood  is 

g)3^Xi 

f*{Ti  = tf,\\Zi  = l)P{Zi  = 1)  = Ae-^*^- ti  > 0. 

1 _|_  gA»  Xj 


When  5i  = 0,  the  contribution  of  the  patient  to  the  likelihood  is 


P{Zi  = 0)  + P{Zi  = l)S*{Ti  = ti,\\Zi  = 1) 


1 


d/3  Xi 


+ 


1 + xi  1 + 


U > 0. 


Type  I censoring  was  assumed  in  Farewell  (1977).  The  author  also  accessed  the 
efficiency  of  the  logistic-exponential  mixture  model  proposed  in  his  paper  relative 
to  a regular  logistic  model,  which  assumes  Z = 0 if  the  patient  does  not  have  a 
relapse  until  T°.  He  defined  as  the  end  point  of  the  observation  time.  He  found 
that  the  efficiency  of  the  mixture  model  increases  as  the  follow-up  time  T°  increases 
and  as  P(Z  = 1)  decreases.  When  the  true  underlying  distribution  is  a mixture 
model,  the  estimate  of  the  cure  rate  from  the  regular  logistic  model  is  biased. 

Farewell  (1982)  proposed  two  more  general  models  for  the  time  to  incidence 
T.  He  assumed  that  T follows  a Weibull  distribution  in  his  first  model.  The 
probability  density  function  and  the  survival  function  of  T given  that  the  patient  is 
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noncured  are  written  as 

f*{T  = t]  X,(  \ Z = l,x)  = exp{— (At)^},  t > 0,  and  (1.4) 

S'*(T  = t I Z = 1,  x)  = exp{— (At)'’},  t > 0,  (1.5) 

where  C is  an  unknown  parameter,  A = exp(— 7^x)  is  a function  of  covariates,  and 
7 is  a vector  of  unknown  parameters  associated  with  covariates  x.  The  likelihood 
function  is  obtained  by  plugging  in  P{Z  = 1)  defined  in  Equation  (1.3),  the 
probability  density  function  defined  in  Equation  (1.4),  and  the  survival  function 
defined  in  Equation  (1.5)  to  the  likelihood  function  in  Equation  (1.2). 

Farewell  (1982)  proposed  the  second  model  for  grouped  data  on  survival 
time.  Assume  there  are  k failure  time  groups  {ti,  tj+i],  i = 1,2, ...  ,k,  the 
conditional  probability  for  T = ti  given  that  T > U is 

^Oi+O'^x 

P(T  = ti  \ Z = 1,T  > ti,x)  = 5;-,  (1.6) 

where  a.i  (z  = 1,  2, . . . , A;)  are  group-specific  parameters.  The  vector  0 is  associated 
with  covariates  x and  is  common  to  all  time  groups.  By  combining  Equations  (1.2), 
(1.3),  and  (1.6),  the  likelihood  function  is  obtained. 

Farewell  (1982)  commented  that  the  mixture  model  is  appealing  from  both 
biological  and  statistical  points  of  view.  It  has  been  used  frequently  in  cancer 
studies,  especially  in  the  study  of  breast  cancer.  Strong  evidence  of  the  existence  of 
individuals  coming  from  different  populations,  however,  is  required  before  using  the 
mixture  model.  Farewell  also  pointed  out  that  the  likelihood  function  might  be  fiat 
for  some  data  sets,  which  would  cause  possible  nonidentifiability  of  the  parameters. 

Farewell  (1986)  discussed  some  problems  related  to  the  mixture  model  by 
applying  the  model  to  two  data  sets.  The  first  data  set  was  from  a breast  cancer 
trial  with  139  patients.  He  applied  a logistic- Weibull  mixture  model  to  the  data. 
Then  he  computed  the  profile  likelihood  function  for  (3q,  the  intercept  parameter 
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in  the  logistic  portion  of  the  mixture  model.  The  profile  likelihood  function  for  Pq 
was  obtained  by  replacing  the  other  parameters  in  the  model  by  their  maximum 
likelihood  estimates  (MLE).  He  found  that  the  profile  likelihood  function  was  in  a 
strongly  nonquadratic  shape,  indicating  that  significant  nonidentifiability  existed 
with  respect  to  the  estimation  of  /3q.  Therefore,  this  logistic-Weibull  mixture  model 
presented  an  unreliable  estimate  of  the  cure  proportion.  Farewell  also  applied  a 
multinomial-weibull  mixture  model  to  a data  set  from  Risch  (1983)  on  the  time 
to  the  onset  of  a major  affective  illness  in  siblings  of  affected  individuals.  Risch’s 
data  were  grouped  into  six  time  intervals.  The  onset  probabilities  were  modelled 
by  a multinomial  distribution.  The  mixture  model  fitted  data  well.  Therefore, 
Farewell  concluded  that  one  should  be  cautious  when  using  mixture  models.  The 
cure  proportion  might  be  very  imprecise  in  some  cases  and  strong  evidence  for  the 
existence  of  two  populations  is  required  to  make  a reliable  inference. 

A significant  amount  of  literature  discussed  the  mixture  model.  The 
common  parametric  distributions  for  the  time  to  incidence  for  the  noncured 
population  can  take  many  forms.  For  example,  they  can  be  exponential  (Berkson 
& Gage  1952,  Bithell  & Upton  1977,  Farewell  1977,  Goldman  1984);  lognormal 
(Boag  1949);  Weibull  (Farewell  1982);  Gompertz  (Gordon  1990);  Generalized 
Gamma  (Yamaguchi  1992);  and  Generalized  F-distribution  (Peng  et  al.  1998). 

The  distribution  for  the  time  to  incidence  can  also  take  nonparametric  or 
semiparametric  forms,  such  as  Kaplan- Meier  (KM)  estimates  (Laska  & Meisner 
1992,  Taylor  1995)  and  the  proportional  hazards  model  (Kuk  & Chen  1992).  The 
probability  P{Z  = 1)  can  be  treated  as  a parameter  in  a Bernoulli  trial.  It  can  also 
be  written  as  a function  of  a vector  of  covariates  x in  a logistic  regression  model, 
such  as  the  model  defined  in  Equation  (1.3). 

To  conclude  the  previous  discussion,  it  is  generally  plausible  to  use  the 
mixture  model  when  there  exists  strong  scientific  evidence  of  the  mixture  of 
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different  populations.  However,  as  both  Farewell  (1977,  1982,  & 1986)  and  Cantor 
and  Shuster  (1992)  pointed  out,  the  likelihood  function  is  sometimes  nonquadratic 
or  flat,  which  could  produce  problems  for  both  computation  and  inference.  In 
addition,  parameter  estimates  could  fall  out  of  the  parameter  space  when  the  data 
do  not  support  a nonzero  surviving  fraction.  The  asymptotic  properties  of  the 
MLE  from  a logistic-parametric  mixture  model  were  developed  systematically  by 
Mailer  and  Zhou  (1996).  The  asymptotic  properties  related  to  the  model  parameter 
estimates  in  both  logistic-semiparametric  and  logistic-nonparametric  models  have 
not  been  explored  in  the  literature. 


1.1.2  Binary  Regression  Model 

Jung  (1996)  proposed  an  interesting  method  to  estimate  the  cure  rate, 
which  is  defined  to  be  the  probability  that  the  time  to  incidence  T is  greater  than 
a fixed  time  r.  In  his  paper,  the  standard  procedure  for  inference  on  parameters 
in  a binary  regression  model  is  modified  to  accommodate  censored  observations. 
Let  0(7t)  = /3^x,  where  0 is  a link  function  which  is  monotonic  and  differentiable 
on  [0, 1].  Also  let  tt*  = the  inverse  function  of  (j).  Let  tt*  = 1 — tt*  and 
7T^(^)  = dTr*{4>)/d(f).  The  MLE  of  in  a standard  binary  regression  model  without 
censoring  is  obtained  by  solving 


n 


-1/2 


^{/(ti  > r)  -7T*(/3^Xi)}- 


7r;(/3'^Xi) 


0. 


7i*(/3  Xj)7r*(/3  Xj) 

In  the  case  with  censoring,  denote  T*  = min(Ti,  Ui).  Assume  independent 
censoring,  i.e.,  Ti  and  Ui  are  independent  given  Xj  for  i = 1,  2, . . . , n.  Also 
assume  that  Ui,U2,  ■ ■ . ,Un  are  independently,  identically  distributed  (iid)  with  a 
cumulative  distribution  function  1 — G.  Let  G be  the  KM  estimate  of  G.  Then  the 
MLE  Pmle  be  obtained  by  solving 


n 


-1/2 


E( 

1=1 


m > r) 


- 7T*(/3^Xi)} 


7r*(/3^Xi)7T*(/3'^Xi) 


-X,:  = 0. 
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Numerically  0mle  can  be  obtained  by  applying  the  Newton-Raphson  method. 
Under  mild  conditions,  it  is  proved  that  ^mle  is  a unique  and  a consistent 
estimate  of  /3  and  n^^‘^{0MLE  ~ P)  follows  an  asymptotic  normal  distribution  with  a 
zero  mean.  The  problem  with  this  method  is  that  r is  very  difficult  to  define.  The 
result  is  not  convincing  if  r is  too  small.  On  the  other  hand,  it  is  not  cost  effective 
for  r to  be  too  long. 

Farewell  (1977)  compared  the  efficiency  of  a logistic-exponential  model 
to  a binary  logistic  regression  model  that  treats  any  individual  with  t*  > r as  a 
long-term  survivor.  He  also  pointed  out  that  the  estimate  of  the  cure  rate  would 
be  biased  using  a logistic  model  when  the  underlying  true  distribution  is  a mixture 
distribution.  The  logistic  model  discussed  by  Farewell  (1977)  did  not  consider 
censoring.  However,  he  emphasized  that  a similar  conclusion  can  be  reached  by 
expanding  the  model  to  include  censored  individuals.  Farewell  also  noted  that  it  is 
especially  unreasonable  for  the  logistic  model  to  ignore  the  possible  censoring  times 
when  T*  exceeds  the  largest  observing  time  r. 

1.1.3  Nonparametric  Method 

Nonparametric  tests  are  widely  applied  for  both  comparing  survival  curves 
with  the  presence  of  cure  and  comparing  cure  proportions  between  treatment  arms. 
When  the  main  interest  is  to  compare  survival  curves  of  several  populations  with 
the  presence  of  cure,  some  of  the  nonparametric  tests  are  considered  standard,  such 
as  the  log-rank  test  and  the  generalized  Wilcoxon  test.  Halpern  and  Brown  (1987) 
studied  the  power  of  these  two  tests  by  simulations  when  the  mixture  cure  rate 
model  by  Goldman  (1984)  is  appropriate.  They  compared  the  power  of  these  two 
tests  for  detecting  the  difference  between  two  treatment  arms  for  selected  values 
of  cure  rates,  instant  failure  rates,  and  follow-up  time.  Each  of  the  two  treatment 
arms  had  50  participating  individuals.  They  used  200  replications  for  each  set  of 
simulation.  The  simulation  results  are  somewhat  inconclusive.  The  log-rank  test  is 
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more  powerful  than  the  Wilcoxon  test  in  general.  However,  the  log-rank  test  does 
not  show  any  advantage  over  the  Wilcoxon  test  in  the  following  situations:  1)  cure 
rates  are  high  and  are  at  similar  levels  in  both  arms;  2)  there  is  a large  difference  in 
the  time  to  the  end  point  for  both  arms;  and  3)  the  follow-up  is  long  relative  to  the 
time  to  failure. 

In  the  comparison  of  cure  proportions,  Sposto  et  al.  (1992)  studied  several 
tests  based  on  simulated  samples.  These  tests  are  the  likelihood  ratio,  the  Wald, 
the  score  tests  based  on  two  different  mixture  models,  and  a test  based  on  the 
differences  of  the  KM  estimates  of  the  cure  rates.  The  two  mixture  models  are 
logistic-exponential  and  logistic- Weibull  models  proposed  by  Farewell  (1977  & 

1982).  The  time  point  in  the  KM  estimate  of  the  cure  rate  is  defined  as  the  last 
observed  failure  time.  Simulated  samples  under  assumed  mixture  models  are 
generated  in  order  to  check  the  results  in  an  ideal  case.  In  addition,  samples 
under  some  different  models  are  generated  in  order  to  study  the  robustness  of 
the  tests  to  model  misspecification.  The  authors  also  generated  samples  based 
on  various  levels  of  cure  proportions.  The  test  statistics  calculated  from  the 
mixture  models  sometimes  can  not  be  obtained  because  of  the  nonconvergence 
during  the  maximization  procedure.  The  authors  pointed  out  two  reasons  for  the 
nonconvergence:  1)  the  maximization  is  obtained  on  the  boundary;  and  2)  the 
algorithm  terminates  abnormally  with  a jump  to  a nonnegative  definite  region. 

Test  statistics  with  convergence  problem  are  not  presented  in  the  article.  In 
conclusion,  the  authors  noted  that  the  KM  estimate  of  the  cure  rate  is  a reasonable 
alternative  to  the  parametric  estimate  in  general.  When  the  underlying  model 
includes  unequal  failure  rates  between  treatment  arms,  the  proportion  test  based  on 
KM  estimates  is  more  powerful  than  the  test  based  on  the  complicated  parametric 
model.  However,  the  KM  estimate  introduces  more  bias  than  the  parametric 
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estimate  of  the  cure  rate  when  the  sample  is  small,  when  failure  rates  are  equal, 
and  when  not  all  failures  are  observed. 

Gray  and  Tsiatis  (1989)  proposed  a linear  rank  test  to  compare  survival 
curves  between  two  treatment  arms  over  the  entire  time  span,  but  their  test 
focused  power  against  the  later  difference  on  survival  curves.  This  linear  rank 
test  is  evaluated  on  a mixture  distribution  of  a proportion  representing  the  cure 
and  a failure  distribution  with  an  unspecified  form.  They  assumed  that  failure 
distribution  functions  for  both  treatment  arms  are  constant  over  time.  They 
specified  a weight  function  in  a two-sample  non-parametric  censored  test  proposed 
by  Tarone  and  Ware  (1977)  in  order  to  obtain  their  own  test.  Note  that  the  log- 
rank  test  also  can  be  derived  from  Tarone  and  Ware’s  test  with  a different  weight 
function.  However,  the  difference  between  Gray  and  Tsiatis’  test  and  the  log-rank 
test  is  that  the  former  put  more  weight  towards  the  later  difference  and  the  latter 
put  equal  weight  throughout  the  entire  observation  period. 

Gary  and  Tsiatis  (1989)  calculated  relative  efficiencies  under  equal  censoring 
for  their  test,  the  log-rank  test,  and  the  proportion  test  based  on  KM  estimates  of 
cure  rates.  When  there  is  no  censoring,  their  test  gains  efficiency  over  the  log-rank 
test  for  a small  cure  rate.  However,  the  efficiency  gain  is  less  for  a cure  rate  greater 
than  50%.  These  results  can  be  explained  by  observing  that  the  weight  functions 
become  closer  for  their  test  and  for  the  log-rank  test  as  the  cure  rate  increases.  In 
addition,  under  the  noncensoring  assumption,  their  test  is  asymptotically  equivalent 
to  the  proportion  test.  When  there  is  censoring  and  when  the  censoring  proportion 
increases,  their  test  is  increasingly  more  powerful  than  the  proportion  test.  But 
note  that  their  test  is  not  a pure  test  of  the  equality  of  the  cure  rates.  Therefore,  it 
has  power  against  alternatives  with  unequal  cure  rates  and  alternatives  with  equal 
cure  rates  and  unequal  survival  curves. 
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In  this  section,  two  types  of  statistical  tests  are  discussed:  1)  comparing 
survival  curves  under  the  assumption  of  the  existence  of  cure  rates;  and  2) 
comparing  cure  rates.  Some  of  the  nonparametric  tests  such  as  the  log-rank  test 
is  quite  standard  in  oncology  for  comparing  survival  curves.  When  the  problem 
narrows  down  to  the  comparison  of  cure  rates,  the  proportion  test  based  on  KM 
estimates  are  quite  popular  due  to  the  computation  difficulties  related  to  the  test 
based  on  the  mixture  model  structure.  However,  there  are  problems  associated 
with  KM  estimates  for  cure  rates,  as  pointed  out  by  Cantor  and  Shuster  (1992). 

The  standard  error  of  the  KM  estimate  increases  in  the  right-hand  extremes 
because  of  the  small  sample  size  at  the  end  of  the  study.  The  KM  estimate 
sometimes  can  produce  counterintuitive  results.  For  example,  the  time-to-relapse 
curve  plateau  could  be  higher  than  the  time-to-dealth  curve  plateau  because  of  the 
large  downward  steps  at  the  end  of  the  study.  Therefore,  one  should  be  cautious 
relying  on  the  KM  estimate  of  the  cure  rate.  As  an  alternative  to  the  KM  estimate 
of  the  cure  rate.  Cantor  and  Shuster  proposed  a parametric  model  with  a bounded 
cumulative  hazard  rate  to  accommodate  the  cure  proportion. 

1.1.4  Model  with  Bounded  Cumulative  Hazard  Rate 
Parametric  Model 

Cantor  and  Shuster  (1992)  proposed  a Gompertz  model  to  incorporate 
long-term  survivors.  The  survival  function  is  written  as: 

S'(t)  = exp[^~^a(l  — e^*)],  a > 0,  (1.7) 

where  a is  the  initial  hazard.  When  /3  > 0,  the  survival  function  S(t)  —)■  0 as 
t — >■  oo;  and  when  ^ < 0,  the  survival  function  S(t)  exp(a//3)  as  t > oo.  The 
cure  rate  is  presented  as  exp(a//d).  The  authors  then  maximized  the  likelihood 
function  based  on  the  Gompertz  model  using  the  Newton-Raphson  method. 

Both  Var(o;)  and  Var(/5)  are  obtained  from  the  observed  information  matrix 
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and  the  variance  of  the  cure  rate  a/^  is  obtained  using  the  Delta  method.  In 
the  article,  they  also  compared  the  Gompertz  model  with  the  mixture  model. 

When  there  is  a cure  proportion,  both  models  produce  similar  curves  and  cure 
rate  estimates.  However,  the  Gompertz  model  is  preferred  due  to  the  divergence 
problem  associated  with  the  mixture  model.  The  authors  also  noted  the  instability 
of  the  KM  estimate  toward  the  end.  Some  other  nice  properties  of  the  proposed 
Gompertz  model  are  also  provided  in  the  article.  For  example,  the  cure  probability 
given  the  patient  has  survived  through  time  to  is  exp(|e^*“);  the  log-likelihood 
function  is  strictly  concave  so  one  can  easily  estimate  the  parameters  and  the 
cure  rate;  and  finally,  S{t)^  is  also  a Gompertz  survival  function  when  S{t)  is  a 
Gompertz  survival  function. 

Gieser  et  al.  (1998)  incorporated  covariates  information  into  the  model 
proposed  by  Cantor  & Shuster  (1992).  The  treatment  specific  survival  function  for 
an  individual  is  written  as: 

5(*;i,x)  = {S.mK"  = exp{eT'’'-‘+“:/?-*(l  - e*')},  (1.8) 

where  a*  is  defined  as  logo;  for  a in  Equation  (1.7),  i is  the  treatment  index,  x is  a 
vector  of  covariates,  and  7 is  a vector  of  parameters  associated  with  x.  Note  that 
S{t]  z,x)  in  Equation  (1.8)  is  also  a Gompertz  survival  function  as  noted  by  Cantor 
and  Shuster  (1992).  The  cure  rate  in  this  case  is 

7t(z,x)  = 5(00;  z,x)  = exp{/3“^e'’'^’^''‘“‘*}. 

The  obtained  log-likelihood  function  is  concave  with  respect  to  parameters.  The 
MLEs  of  the  parameters  are  efficient  and  asymptotic  normal  when  the  assumed 
model  is  correct  and  when  the  sample  size  is  large.  The  authors  compared  the 
treatment  effects  on  cure  rates  for  any  given  covariates  vector  x.  The  ratio  of  the 
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log  of  the  cure  rates  for  two  different  treatment  arms  is  expressed  as 

^ ^ log7r(f,x)  ^ 

” log7r(f',x) 

The  null  hypothesis  on  testing  the  equality  of  cure  rates  is  written  as 

Hq  : log^../  = 0 

and  the  test  statistic  is  expressed  as 

Z =[di  + log(-/?./)  - log(-A)  - d^>]/SE, 

where  the  standard  error  in  the  denominator  is  obtained  by  the  Delta  method. 

Gieser  et  al.  (1998)  applied  their  model  to  763  patients  from  a multicenter 
prospective  study  preformed  in  1981  by  the  Pediatric  Oncology  Group.  This 
study  analyzed  the  standard  risk  of  acute  lymphocytic  leukemia  in  children.  They 
compared  two  treatment  effects  on  the  cure  rate.  Ghildren’s  age  and  sex  are 
considered  as  covariates.  The  Newton-Raphson  algorithm  is  used  to  yield  MLEs 
of  the  parameters.  The  algorithm  converges  after  nine  iterations.  Based  on  this 
study,  they  compared  the  estimates  and  the  variances  of  the  cure  rates  based  on 
the  Gompertz  model,  the  logistic-exponential  mixture  model,  and  the  KM  method. 
They  found  that  the  Gompertz  model  produces  similar  estimate  of  the  cure  rate  to 
the  KM  estimate  but  with  much  smaller  variance.  They  also  provided  a generalized 
model  in  which  the  treatment  indicator  is  included  as  one  of  the  covariates.  One 
important  advantage  of  this  generalized  model  is  the  log-linear  structure  of  the 
parameters,  since  the  log-linear  structure  implies  the  concavity  of  the  log-likelihood 
function  and  the  existence,  the  uniqueness,  and  the  asymptotic  normality  of  the 
parameter  estimates. 
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Semiparametric  Model 

Chen  et  al.  (1999)  proposed  a semiparametric  model  for  the  survival  data 
with  a cure  fraction.  They  assumed  that  there  are  two  variables:  a)  the  number  of 
carcinogenic  cells  N*  in  the  body  of  a patient;  and  b)  the  incubation  time  Wi  for 
the  carcinogenic  cell  (i  = 1, 2, . . . , A^*).  The  number  of  carcinogenic  cells  = 0 
implies  that  there  is  no  such  cell  in  the  patient’s  body.  Assume  that  N*  follows  a 
Poisson  distribution  with  mean  9 and  assume  that  ITj  for  z = 1, 2, . . . , A^*  are  iid 
with  distribution  function  F{w).  Variables  Wi  and  N*  are  independent.  Therefore, 
the  time  to  incidence  is  T = min(lT,,0  < i < N*).  Note  that  P{Wo  = oo)  = 1. 
Then  the  survival  function  for  T is  presented  as 


S{t) 


P(No  incidence  by  time  t) 

P{N*  = 0)  + P{T  >t,  N*  > 0) 


P{N*  = 0)  + P{Wi  >t,W2>t,...,WN,>t,N*>  0) 


.-e 


i=l 


e-eF{t)_ 


(1.9) 


Note  that  S{t)  is  an  improper  survival  function  because  S{t)  e~^  > 0 as  t — oo. 
The  survival  fraction  e~^  is  indeed  the  probability  that  there  exists  no  carcinogenic 
cell  in  a patient’s  body.  The  corresponding  probability  density  function  and  the 
hazard  function  are 


f{t)  = 9f{t)exp{-9F{t)), 

m = 0f{t). 

Let  S*{t)  denote  the  survival  function  for  the  noncured  proportion  of  patients,  then 

_ f,-6 

S*{t)  = P{T>t\  N*  >1)  = 


1 — 


(1.10) 
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The  corresponding  probability  density  function  is 

d 

/•(*)  = 

and  the  hazard  function  is 

f*ii)  1 

= P(r<oo|r>T<^>' 

The  likelihood  function  is  derived  as  follows 

n 

n 

i=\ 

= ni»/('ni''e“"'"”-  (I'll) 

i—\ 

The  model  is  semiparametric  in  the  sense  that  the  form  of  F{t)  is  unknown. 
It  has  a strong  biological  meaning  by  gathering  contributions  from  two  charac- 
teristics of  the  tumor  growth,  the  initial  numbers  of  carcinogenic  cells  and  the 
rate  of  their  growths.  Even  without  any  biological  meaning,  the  model  still  fits 
any  data  with  a survival  fraction.  It  can  always  be  treated  as  being  generated 
by  an  unknown  number  N*  of  latent  competing  risks.  In  this  model,  S'(t),  /(t), 
and  \{t)  are  improper,  while  S*{t),  and  \*{t)  are  proper  survival  function, 

density  function,  and  hazard  function,  respectively.  One  advantage  of  this  model 
over  the  mixture  model  is  that  the  overall  hazard  function  \{t)  has  a proportional 
hazard  structure  if  the  covariates  enter  the  model  through  9.  In  the  mixture  model, 
however,  the  overall  hazard  function  does  not  have  a proportional  hazard  structure 
if  the  covariates  enter  the  model  through  the  cure  rate  tt.  The  hazard  function 
h*{t)  for  the  failure  proportion  is  magnified  by  p(^x<oo\T>t)  ^ ^ compared  to  h{t). 

As  t goes  to  infinity,  function  h*{t)  converges  to  h{t).  In  addition,  the  authors 
pointed  out  the  close  relationship  between  the  model  presented  in  Equation  (1.9) 
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and  the  mixture  model  presented  in  Equation  (1.1)  by  observing  that  S{t)  defined 
in  Equation  (1.9)  can  also  be  written  as 


If  we  let  P{Zi  = 0)  in  Equation  (1.1)  take  the  value  of  and  let  S*{t  \ Zi  = 1) 
take  the  form  of  S*{t)  defined  in  Equation  (1.10),  then  S{t)  has  the  same  form  as 
in  Equation  (1.1).  Model  in  Equation  (1.1)  also  corresponds  to  some  model  of  the 
form  in  Equation  (1.9)  for  some  6 and  F{t).  Note  that  F{t)  is  assumed  to  have 
a Weibull  form  for  convenience  in  the  article.  The  parameter  ^ is  a function  of 
covariates  in  the  form  oi  9 = exp(/3^x). 

Tsodikov  (1998)  proposed  the  same  semiparametric  model  from  different 
angles.  As  above,  let  0 be  a function  of  covariates,  denoted  by  ^(x, /3).  Assume 
that  n is  the  number  of  failure  observations  and  6f  is  the  parameter  of  interest 
for  the  failure  observation.  Let  mi  be  the  number  of  censored  observations  in 
the  time  interval  (ti,ti+i),  where  U is  the  failure  time  associated  the  failure 
observation.  Let  , Urm  be  the  censoring  time  for  the  observations  censored 

within  interval  {U,  U+i).  Let  9^j{i  = 1, 2, . . . , n,  j = 1,2,...,  m^)  be  the  parameters 
of  interest  for  the  censored  observations.  Also  let  be  the  sum  of  9s  for  all 
individuals  on  the  interval  and  let 


for  0 < a < 5 < n.  Note  that  Ylt  = 0 Fla  = 1 if  o > 5.  The  form  of  F{t)  is 
completely  unspecified. 


S{t)  = e-^  + - e-^). 


b 


k=a 

b 
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In  order  to  draw  inference  on  the  cure  proportion  with  the  presence  of  an 
unknown  form  of  F{t),  the  author  applied  the  partial  likelihood,  the  marginal 
likelihood,  and  the  profile  likelihood  function  of  6.  The  partial  likelihood  function 
is  written  in  the  form 


r 

q q Q ■ 

The  partial  likelihood  is  valid  due  to  the  proportional  hazard  structure.  It  provides 
consistent  estimates.  However,  it  can  only  be  used  to  estimate  the  ratio  of  9 and 
9b,  the  baseline  value.  It  is  therefore  inconclusive  in  the  estimation  of  9b-  When 
there  is  no  covariate,  i.e.,  when  all  patients  are  homogeneous,  cure  rate  can  not  be 
estimated. 

Both  the  marginal  and  the  profile  likelihood  are  derived  from  the  full 
likelihood  function  which  is  written  as 


n rrii 

L = (1.12) 

j = l 

where  Q{t)  is  the  survival  function  for  the  censoring  time.  Type  I censoring 
mechanism  with  a fixed  end  point  tc  is  assumed  throughout  the  article.  Then  Q 
is  a single-step  function  such  that  Q{U)  = 1 for  z = 1, 2, . . . , n,  AQ{tc)  = 1,  and 
tnj  = tc  for  j = 1,2, . . . , nin-  The  marginal  likelihood  function  is  derived  as 


■ ■ ■ ^n,n  _ . r 


i-0 


The  profile  likelihood  function  is  derived  from  Equation  (1.12)  by  maximizing  it 
with  respect  to  F.  Nonparametric  maximum  likelihood  estimator  (NPMLE)  is  used 
to  estimate  F. 

Both  Chen  et  al.  (1999)  and  Tsodikov  (1998)  proposed  the  same  semipara- 


metric  model,  while  using  different  estimation  techniques.  The  former  assumed 
independent  and  noninformative  censoring  as  well  as  Weibull  distribution  for  F{t) 
in  the  model.  Therefore,  their  model  is  a parametric  model  although  F{t)  can  take 
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a nonparametric  form.  The  latter  assumed  a Type  I censoring  mechanism,  which  is 
restrictive. 

Both  the  parametric  and  the  semiparametric  models  with  a bounded 
cumulative  hazard  rate  are  appealing  because  of  the  simplicity  and  intuitive 
meaning  of  these  models.  However,  there  are  limitations  for  the  more  flexible 
semiparametric  model.  It  is  difficult  in  terms  of  computation.  In  addition,  the 
asymptotic  properties  of  this  model  have  not  been  explored  in  the  literature. 

In  the  next  section,  we  will  reexamine  the  semiparametric  model  proposed  by 
Chen  et  al.  (1999)  and  Tsodikov  (1998).  In  this  examination,  we  make  no 
parametric  assumption  for  F{t).  In  addition,  we  do  not  limit  the  censoring  to  Type 
I mechanism.  By  using  the  simulated  data,  we  show  the  computational  barrier 
raised  by  this  model. 


1.2  Computation  Difficulty  of  the  Semiparametric  Model 
We  assume  random  and  noninformative  censoring.  The  notations  used  in 
Tsodikov  (1998)’s  paper  are  used  in  the  section  except  some  changes  are  made  in 
the  definition  of  Pa,b  and  We  define  them  as  follows: 


Pa,b  = 

Sa,bPa+l,b  ' 

■ • Sb,b, 

II 

Pa,bPa,b—l 

■ ■ 

^a,a 

The  likelihood  function  takes  the  following  form, 

i=l 

where  f{ti)  is  the  density  function  corresponding  to  F{ti).  Define 


(1.13) 


/•  = (-1)' 


g — t+l,n 

pn-i+l,np 
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and 


Let  Lm  denote  the  marginal  likelihood  function.  As  derived  in  Appendix  A, 


Without  loss  of  generality,  assume  that  there  is  only  one  covariate  x in  the 
study,  such  as  the  treatment  indicator.  Also  assume  that  0 is  a function  of  the 
covariate  x and  is  expressed  in  an  exponential  term  as  exp  (a  + /?x).  The  first  and 
the  second  derivatives  of  with  respect  to  are  denoted  by  S'-j  and  S”^  They 
are  derived  as  follows; 


n 


(1.14) 


i=l 


The  score  function  for  a is  obtained  as 


n 


T* 


and  the  score  function  for  /?  is  written  as 


n 


n 


T* 


The  Hessian  matrix  is  calculated  and  written  as 


d‘^\og{Lm)/da'^  d'^\og{Lm)/dad/3 
d‘^\og(Lm)/dadp  d‘^\og{Lm)/d(3^ 


where 
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a^og(L„)/aaa/?  = 


i=0  " 


'ra— i+l,n 


and 


" i=0  " i=0 

where  Jf  is  the  first  derivative  of  J*  with  respect  to  j3  and  is  expressed  as 


Suppose  that  the  covariate  x is  the  treatment  indicator,  i.e.,  a:  = 1 if  the 
patient  is  in  the  treatment  group  and  0 if  the  patient  is  in  the  control  group.  We 
are  now  interested  in  testing  the  hypothesis: 


The  score  test  (Rao,  1973)  is  used  to  test  the  hypothesis  in  Equation  (1.15). 

The  score  statistic  for  (3  under  Hq  is  denoted  as  U{^)\ho  and  the  asymptotic 
variance  for  the  score  statistic  is  —5^  \og{L^)ld0^  under  the  null  hypothesis.  If  a is 
assumed  known,  then  the  standardized  test  statistic  is 


In  order  to  explore  the  feasibility  for  computation,  we  first  generate  10000 
samples  under  the  assumption  that  F is  an  exponential  distribution  with  a mean 


q;  = 1.5  and  /?  = 0.  Each  random  sample  contains  30  observations  in  the  control 
group  and  30  observations  in  the  treatment  group.  The  censoring  proportion  is 
around  50%.  The  standardized  score  statistic  for  (3  is  calculated  for  all  random 
samples  generated.  The  plot  of  the  10000  standardized  score  statistics  is  presented 
in  Eigure  (1-1).  In  this  bar  plot,  we  observe  that  the  score  statistic  behaves  well 
and  is  close  to  a normal  shape  given  that  a is  known. 


Ho:  P = 0. 


(1.15) 


umn. 


(1.16) 


v^-aMog(Z„)/a/?2|„„' 


of  1.  Then  the  survival  function  is  exp{—9F)  = exp(— (1  — e Assume 


20 

The  problem  becomes  difficult  to  handle,  however,  when  a is  unknown.  It 
is  common  to  use  a profile  likelihood  method  to  handle  the  nuisance  parameter 
a when  our  primary  parameter  of  interest  is  /?.  The  profile  marginal  likelihood 
function  for  /?  is  obtained  from  the  marginal  likelihood  function  by  replacing  a 
by  its  MLE  under  Hq.  The  test  statistic  based  on  the  profile  marginal  likelihood 
function  of  j3  is  similar  to  the  one  defined  in  Equation  (1.16)  given  that  some 
adjustment  on  the  denominator  of  Equation  (1.16)  is  required.  Therefore,  our 
primary  task  is  to  solve  for  the  MLE  of  a under  Hq.  By  using  the  same  parameter 
setting  for  Eigure  (1-1),  we  generate  a sample  with  30  patients  in  the  treatment 
group  and  30  in  the  control  group.  We  show  in  Figure  (1-2)  the  plot  of  the  log  of 
the  marginal  likelihood  function  as  a function  of  a on  interval  [—2, 6]  when  [3  is 
valued  at  0.  The  marginal  likelihood  for  a under  Ho  is  poorly  behaved.  In  order 
to  further  investigate  the  behavior  of  the  marginal  likelihood  function  on  a and 
/?,  we  show  in  Figure  (1-3)  a three-dimensional  plot  of  the  log  marginal  likelihood 
function  defined  based  on  the  same  random  sample  as  a function  of  a on  the 
interval  [—2,2]  and  (3  on  the  interval  [—3,3].  The  poorly  behaved  log  marginal 
likelihood  as  a function  of  a and  /?  explains  the  behavior  of  the  log  marginal 
likelihood  function  in  Figure  (1-2). 

1.3  Piecewise  Exponential  Model 

As  we  mentioned  earlier  in  the  chapter,  it  is  sometimes  necessary  to  assume 
different  hazard  functions  for  the  time  to  failure  on  different  time  intervals. 
Friedman  (1981)  proposed  a piecewise  exponential  model  to  handle  this  problem. 
Let  the  time  scale  be  divided  into  I{n)  intervals  (0,Ti],  (Ti,T2],  . . .,  (T/(„)_i, T/(„)]. 
The  hazard  rate  of  each  individual  is  assumed  to  be  constant  over  any  given 
interval.  Denote  the  hazard  rate  of  the  individual  in  the  interval  by  . 
Assume  Xij  > 0 for  all  {i,j).  Let  lij  = logA^  and  let  Uj  be  the  survival  time  of  the 


21 


individual  in  the  interval.  Define  Uj  as 


Uj  = max{0,min(Tj  - - Tj_i)}. 


(1.17) 


Let 


0,  otherwise. 


1,  when  the  individual  fails  in  [Tj_i,Ti) 


Then  the  log-likelihood  function  is 


logZ/(l)  ^ y ^ ^ exp ( l-ij ) . 


(1.18) 


ij  ij 


One  of  the  forms  can  take  is  lij  = a -t-  ^kXj- 

The  author  proved  the  existence  and  the  uniqueness  of  the  MLE  of  the 
vector  1.  The  consistency  and  the  asymptotic  properties  of  the  MLE  were  also 
explored.  Karrison  (1987)  also  considered  a piecewise  exponential  model  in  his 
study. 


Frequency 


22 


o 

o 

o 

cvi 


-4  -2  0 2 4 


Standardized  Score 


Figure  1-1.  Standardized  score  under  Hq  (/5  = 0)  for  10000  random  samples. 
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a 


Figure  1-2.  Log  marginal  likelihood  function  of  a when  ^ = 0. 


Marginal  Likelihood  Function 
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'i? 


Figure  1-3.  Log  marginal  likelihood  function  of  a and  /?. 


CHAPTER  2 
TOPIC  OF  RESEARCH 


The  model  we  propose  in  this  dissertation  is  a piecewise  Gompertz  model. 

It  incorporates  the  cure  proportion  by  assuming  a bounded  cumulative  hazard 
rate.  Moreover,  our  model  gives  the  flexibility  that  the  survival  times  for  different 
time  intervals  may  have  different  hazard  functions.  As  noted  by  Aitkin  et  al. 

(1989)  that  a piecewise  exponential  model  has  the  same  flexibility  as  a proportional 
hazards  model  under  the  assumption  that  each  interval  is  defined  by  one  failure 
observation  in  the  piecewise  model.  The  choice  of  using  a Gompertz  model  over 
an  exponential  model  is  made  because  of  the  possible  plateau  of  the  survival  curve 
from  a Gompertz  model.  We  would  speculate  that  under  a piecewise  Gompertz 
model,  the  more  intervals  we  define,  the  more  flexible  the  model  is.  Eventually, 
when  there  is  only  one  failure  observation  on  each  interval,  the  flexibility  of  this 
model  would  be  the  same  as  the  semiparametric  model  we  discussed  in  Chapter  1. 
Therefore,  our  model  preserves  the  flexibility  of  a semiparametric  model  to  some 
extent.  The  degree  of  the  flexibility  depends  on  the  number  of  intervals  we  define 
for  our  model.  Meanwhile,  the  computation  for  our  model  is  much  easier  to  handle 
than  a semiparametric  model  due  to  our  model’s  parametric  structure.  However, 
when  the  number  of  intervals  in  our  model  increases,  the  number  of  individuals 
on  each  interval  decreases.  We  would  anticipate  large  variance  for  the  estimate 
of  the  cure  rate,  which  is  similar  to  the  situation  we  discussed  earlier  for  the  KM 
estimate  of  the  cure  rate.  An  appropriate  number  of  intervals  needs  to  be  accessed 
in  order  to  reach  maximum  flexibility  and  computational  feasibility  of  the  model. 

In  this  chapter,  we  describe  our  model  in  detail.  In  Chapters  3 and  4,  asymptotic 
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properties  of  the  MLEs  of  parameters  in  the  model  are  proved.  In  addition,  a 
goodness-of-fit  test  is  derived  which  can  be  applied  to  determine  the  appropriate 
number  of  time  intervals  in  the  model.  In  Chapter  5,  a Monte-Carlo  simulation 
study  is  carried  out  to  show  the  advantages  of  our  model.  Chapter  6 gives  a real 
example  and  demonstrates  the  application  of  our  model  to  the  example. 

2.1  Data  Structure  and  Model 

Assume  there  are  n patients  in  the  study.  Consider  the  patient  who  is 
in  treatment  group  g and  is  associated  with  a vector  of  covariates  Xj,  where  Xj  is  a 
p-dimensional  vector.  Let  Tj  and  Uj  be  the  survival  time  and  the  censoring  time  of 
the  patient,  respectively.  The  observed  data  on  the  patient  are  (t*,6j,Xj), 
where  t*  is  the  observation  on 

Tj  = mm{Tj,  Uj), 

and  Sj  is  the  observation  on 

A,  = I{T,  < Uj). 

Assume  there  are  G treatment  groups  and  the  interval  [0,  oo)  is  divided 
into  / intervals  [ri_i,ri),  i = 1,  2, . . . ,/,  where  tq  = 0 and  tj  = oo.  Let 
0^  = {0^ , A^),  where  (3  is  p-dimensional  and  A = (A^,  A^, . . . , A^)^.  Note  that 
Ag  = (Aig,  A23, . . . , \ig).  Clearly,  0 is  a {p  + 7G)-dimensional  vector.  It  is  assumed 
that  the  hazard  function  for  the  patient  in  treatment  group  g has  the  form 

tG[ri_i,Ti).  (2.1) 

Define  an  indicator  function  Kg{j),  where 

{1  if  patient  is  in  group  g; 

0 otherwise. 
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Let  K{j)  be  a vector  of  Kg{j)  for  ^ = 1, 2, . . . , G,  i.e., 


K{j)  = («l(i),«2(j),-..,KG(i))^ 


Let  us  also  define 


for  i = 1, 2, . . . , 7 and 


Mt) 


1 if  t € [ri_i,rj); 
0 otherwise, 


Let 


Q{t,j)  = K{j)  0 J(t)  = 


K2(i)JW 


\KG{j)J{t) 


Note  that  Q(t,  j)  is  a G/-dimensional  vector.  The  hazard  function  in  Equation 
(2.1)  is  also  written  as 

if  «g(i)  = 1 and  Jj(t)  = 1. 

The  cumulative  hazard  function  Hj{t,9),  the  survival  function  Sj{t,6),  and  the 
density  function  fj  (t,  0)  can  be  derived  as  follows: 

Hj{t-,0)  = [ h(u;  A,  Ac(j),  J(u))du, 

Jo 

Sj{t]0)  = exp{— f h{u-,X,K{j),J{u))du}, 

Jo 


The  likelihood  function  contributed  by  (tj,5j,Xj)  is  expressed  as 
L„,  = A,  ((•;»). 
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The  corresponding  log-likelihood  function  is  written  as 


Inj 


\ogLj 


= 6j[\og{h{t*]  X,  K{j),J{t*)))  + /3'^y.j]  - ['  h{u-,X,K{j),J{u))du. 

Jo 


(22) 


consequently,  the  overall  likelihood  function  is  defined  as 

n 

Ln  — L'nj ) 
i=i 

and  the  corresponding  log-likelihood  function  is 


^ ^ ^nj  ■ 

t=l 


2.2  Assumptions 

Our  derivation  of  asymptotic  properties  is  based  on  the  following  three 
assumptions: 

Assumption  1.  There  is  a 5 > 0 such  that  ||xj||  < J5  for  j = 1, 2, . . . , n. 
Assumption  2.  There  exist  e > 0,  5 > 0,  and  an  integer  no-  when  n > 
no,  there  exists  a subset  A of  overall  individuals  with  n^/n  > 5,  where  ua 
is  the  size  of  the  subset  A such  that  for  any  unit  vector  a, 

I a^Xj  |>  e for  G A. 

Assumption  3.  Let  Ug  be  the  number  of  individuals  in  treatment  group  g, 
where  g = 1,2, ...  ,G.  There  exists  a (5  > 0 such  that  mini<p<G  ^ > 6 when  n 
is  sufficiently  large. 

Assumption  4.  The  censoring  variables  Ui,  U2, . . . ,Un  are  iid  with  a survival 
function  Gu{u).  Assume  that  there  exists  an  e > 0 such  that 


Gv{ti-i)  — Gu{oo)  > e. 
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The  first  assumption  assumes  that  all  covariates  are  uniformly  bounded. 

It  is  reasonable  in  practice  because  the  majority  of  the  covariates  in  a cancer 
study  are  patients’  characteristics,  which  are  bounded  uniformly.  Note  that  we 
are  primarily  interested  in  time-independent  covariates.  The  second  assumption 
assumes  that  at  least  a set  A of  patients’  covariates  are  dispersed  to  some  extent. 
Assumption  3 means  that  the  number  of  observations  in  each  treatment  group  must 
increase  proportionally  to  the  sample  size.  Assumption  4 assumes  that  not  all  the 
censoring  observations  occur  very  early  on  the  time  span. 

2.3  Hypothesis  of  Interest 

We  are  interested  in  comparing  cure  rates  among  treatment  groups.  Let 
7Tg(x)  be  the  cure  rate  of  patients  who  are  under  treatment  group  g and  who  are 
associated  with  covariates  vector  x.  Considering  the  case  with  two  treatment 
groups,  the  null  hypothesis  is 


Ho  : iTi(x)  - ?T2(x)  = 0, 


(2.3) 


for  any  covariate  x.  The  test  statistic  will  be  derived  in  the  following  section. 


2.4  Test  of  Hypothesis 

The  cure  rate  corresponding  to  patients  in  treatment  group  g who  are 
associated  with  a specific  setting  of  covariates  x is 

poo 

7Tp(x)  = S{oo\  6)  = exp{— / h{u\  A,  K{j),J{u))du} 

Jo 

= exp{e^"[^ + — ]}. 

i=l  ^*9  '^^9 

The  hypothesis  defined  in  Equation  (2.3)  is  equivalent  to 


(2,4) 


H, 


i=l 


+ - 


A,: 


A 
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-}-{E 

t=l 


^ ^ gAj2Ti-l  gA/2T/_i 

+ 


A, 
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A 


0. 
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/V  T ''  T 

Let  the  MLEs  oi  6 = (A^  0^Y  be  denoted  by  0 = (A  ^ )^.  The  test  statistic  is 
derived  as 


i=i  i=i  \2  a/2 

The  variance  of  the  above  test  statistic  can  be  derived  by  the  Delta  method.  The 
test  statistic  given  in  Equation  (2.5)  is  a function  of  the  estimates  of  A.  We  will 
show  in  later  chapters  that  6 follows  an  asymptotic  normal  with  a mean  d and  a 
variance-covariance  matrix  S.  The  variance  estimate  of  the  test  statistic  can  be 
derived  as  follows: 

V(T)  = (f 

where  (|^  Ia=a)  represents  the  derivative  of  the  test  statistic  with  respect  to  A 
while  evaluated  at  the  MLE  of  A.  E*  is  the  variance-covariance  matrix  of  A and  E* 
is  the  estimate  of  E*.  The  vector  is  derived  as  follows: 


dT 

dXii 

dT 

dXi2 


A?/ 


A?2 


for  i = 1, 2, — 1,  and 


dT  _ (A/ir/_i  — 

dT  _ (A/2T/_i  - 

5A/2  A^2 

The  normalized  test  statistic  is 


it  follows  an  asymptotic  standard  normal  distribution. 


CHAPTER  3 

ESTIMATION  OF  PARAMETERS 


3.1  Likelihood  Function  and  Information  Matrices 


Based  on  the  log-likelihood  function  defined  in  Equation  (2.2),  the  score 
statistic  Snj{0)  for  the  individual  can  be  expressed  as 


Sn,W  = 


(3.1) 


where 


Snj{0)  = ^ ^ ' h{u;  A,  K{j),J{u))du  (3.2) 


and 


5g’(«) 


^ P Q{u,j)uh{u-,X,K(j),3{u))du.  (3.3) 

Note  that  S^j{0),  and  Snj{6)  are  p,  IG,  and  {p  + 7G)-dimensional  vectors, 

respectively.  Furthermore,  the  matrices  formed  by  second  order  derivatives  of  /„j 
can  be  derived  as 


_ r 


= - 


dl3d0 


FifhO)  = 


^ ^ d(3dX^  ^ 

dH 


dXdl3^ 


/ h(u;  A,K(i),  J(u))du, 

Jo 

' / ^ QF{u,j)uh{u;X,K{j),J{u))du, 
Jo 

Q(u,  j)xjnh(u;  A,K(i),  J(u))du, 


and 


dXdX^ 


[ Q{u,j)Q^{u,j)u‘^h{u;X,K{j),J 
Jo 


[u))du. 
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Then  the  observed  Fisher’s  information  matrix  is  written  as 

dt  ’(»)  y 

f-t”. 


3^^^^  ['  h{u-,X,K{j),J{u)) 
Jo 


i 


\ 


^uQ{u,j)  j 


(x[  uQ^(u,j))du,  (3.4) 


which  is  a (p  + IG)  x {p  + IG)  matrix.  The  Fisher’s  information  matrix  is  the 
expectation  of  Fnj{6),  which  is  written  as 


pT* 

( \ 

= E{Fr,j{e))  = 

/ h{u;X,K{j),J{u)) 

^Jo 

\uCl{uG)  ^ 

uQ^(u,j))du  . 


(3.5) 


3.2  Concavity  of  Likelihood  Function 
The  log-likelihood  function  for  n individuals  is  written  as 

^n(^)  = '^{5j[\og{h{t*]X,K{j),3{t*)))  +/3'^Xj]  - / h{u-X,K{j),J{u))du}. 

j=i 

(3.6) 

The  score  statistic,  the  observed  Fisher’s  information  matrix,  and  the  Fisher’s 
information  matrix  for  n individuals  in  the  sample  are  respectively  as  follows: 


n 


5„(9)  = ^S„,{9), 
f=l 

(3,7) 

n 

F„(9)  = ^F„,(9), 

f=l 

(3.8) 

and 

Tt 

Dfi  — ^ ^ ^nj  • 

j=l 

(3.9) 

Let  = (a^  b^),  where  a is  a p-dimensional  vector  and  b = {bn  • 

••  bn  ■ 

• • biG 

• ■ • bio)^  is  a JG-dimensional  vectors.  We  will  show  that  Fn{6)  is  positive  definite 


and  therefore,  the  log-likelihood  function  defined  in  Equation  (3.6)  is  strictly 
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concave.  To  show  Fn{6)  is  positive  definite,  it  suffices  to  show  that  c^F„(0)c  > 0 
for  any  non-null  vector  c of  dimension  {p  + IG).  In  order  to  prove  that  F„(0)  is 
positive,  we  assume  that  there  is  at  least  one  observation  on  each  interval  for  each 
treatment  group.  Note  that 


^Fn{e)c 


/ \ Tl 

( \ 

(a’- 

a 

V / j=l 

= f /i(w;  A,k(j),  J(u))[a^Xj 

j=i 

> 0. 


uh^Q{u,  j)Y  du 


This  implies  that  F„(0)  is  semipositive  definite.  Since  /i(u;  A, /c(j),  J(u))  > 0, 
c^Fn{0)c  = 0 implies  that  for  any  u 6 (0,  tj),  j = 1,  2, . . . , n, 

a^Xj  4-  hFQ{u,j)u  = 0.  (3.10) 


For  any  integer  i,  1 < i < I and  for  any  integer  g,  I < g < G,  there  exists  an 
individual  j (say)  in  group  g and  t*  G (rj_i,Ti).  Then  Equation  (3.10)  implies  that 

a^Xj  -I-  ubig  = 0 for  u G 


Consequently 


a^Xj  = 0 


and 

kg  = 0.  (3.11) 

Since  Equation  (3.11)  holds  for  any  integer  i,  1 < i < I and  g,  1 < 5 < G,  it  is  then 
proved  that 


b = 0. 
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Therefore,  Equation  (3.10)  implies  that  for  any  j — 1, 2, . . . , n, 


= 0, 


i.e., 


-Xj  = 0 for  all  j = 1,2, 


,n. 


which  is  a contradiction  to  Assumption  2. 

Therefore,  Fn{9)  is  positive  definite.  The  strict  concavity  of  the  log- 
likelihood  function  as  a function  of  unknown  parameters  0 guarantees  the 
uniqueness  of  the  MLEs  of  the  parameters  provided  that  the  MLEs  exist.  Based 
on  the  existence  of  the  MLEs  and  the  concavity  of  the  log-likelihood  function,  the 
Newton-Raphson  algorithm  converges. 


CHAPTER  4 

PROPERTIES  OF  THE  MAXIMUM  LIKELIHOOD  ESTIMATES 

4.1  Introduction 

In  the  previous  chapter,  we  proved  the  strict  concavity  of  the  log-likelihood 
function.  We  pointed  out  that  we  obtain  unique  MLEs  of  parameters  if  we  can 
prove  that  these  MLEs  exist.  In  this  chapter,  we  will  first  provide  some  preliminary 
results  as  a preparation  for  later  derivations.  Then  we  will  show  the  existence,  the 
consistency,  and  the  asymptotic  normality  of  the  MLEs  obtained.  The  method 
applied  to  prove  the  properties  of  the  MLEs  follows  a similar  method  provided 
by  Mailer  and  Zhou  (1996,  PP21 1-222).  Mailer  and  Zhou  proved  the  asymptotic 
properties  of  MLEs  in  a survival  model  which  assumes  that  the  failure  time  follows 
an  exponential  distribution  with  covariates  information.  The  exponential  model 
with  covariates  in  their  book  is  a special  case  of  the  Gompertz  model.  By  studying 
the  piecewise  Gompertz,  we  move  one  step  further  through  the  incorporation  of 
different  hazard  functions  on  different  time  intervals.  In  our  model,  if  we  allow 
A = 0,  the  hazard  function  for  the  failure  time  T is  h{t)  = Therefore,  the 
hazard  is  a constant,  which  implies  that  T follows  an  exponential  distribution. 

In  the  next  section,  we  state  several  lemmas  which  will  be  applied  to  prove 
the  existence,  the  consistency,  and  the  asymptotic  normality  of  the  MLEs.  The 
proofs  of  the  lemmas  are  given  in  detail  in  the  Appendices.  We  prove  the  existence 
and  the  consistency  of  the  MLEs  in  Section  4.3.  The  proofs  of  the  asymptotic 
normality  is  stated  in  Section  4.4.  In  Section  4.5,  a goodness-of-fit  test  which 
compares  models  with  different  numbers  of  pieces  is  derived.  We  also  show  that  the 
corresponding  test  statistic  follows  an  asymptotic  distribution. 
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4.2  Preliminary  Proofs 

Lemma  4.1.  Let  E{-)  and  F(-)  represent  the  expected  value  and  the  variance 
of  a random  variable,  respectively.  Then  the  following  two  equations  hold: 

E{5„(6>)}  = 0 and 
V{Sn{d)}  = D,. 


Proof.  See  Appendix  B.  □ 

Lemma  4.2.  Let  6q  = (/3g,  Aq)  be  the  true  parameter  vector,  where  Aq  = 

, . . . , Xfi, . . . , Ajq,  . . . , Xf^).  For  any  positive  integer  m and  for  I = 1,2, 
there  exist  an  e > 0 and  a F > 0 such  that  for  any  A satisfies  ||A  — Ao||  < e, 

pT-  I 

Eeo{  ' rt(u;  A',k(j),  J(u))du}  < F, 
for  _;■  = 1, 2, . . . , n.  Note  that  F is  a constant  which  depends  on  and  e. 


Proof.  See  Appendix  C. 


□ 


Lemma  4.3.  Define  0q  as  in  Lemma  4.2.  For  any  positive  integers  I and  m, 
there  exists  a 7 > 0 such  that 

u'^h^{u\Xo,K{j),J{u))du}  > (4.1) 

for  j = 1, 2, . . . , n.  Note  that  7 is  a constant  depending  on  Oq. 

Proof.  See  Appendix  D.  □ 


Lemma  4.4.  Under  Assumptions  2,  3,  and  4,  there  exist  a 7 > 0 and  an  integer 
no  > 0 such  that  when  n > no, 


Amin(F^n) 


n 


> 7, 


where  Amin(D„)  is  the  smallest  eigenvalue  of 
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Proof.  See  Appendix  E.  □ 

4.3  Existence  and  Consistency  of  the  MLE 
For  each  A > 1,  define  an  elliptical  neighborhood  of  the  true  value  do  of 
unknown  parameters  by 

N{do)  = {e:{e-  OoVDnie  - do)  < a}.  (4.2) 


As  shown  in  Lemma  4.4,  Amin(-Dn)  > when  n > no-  Hereafter  we  assume  that 
the  condition  n > no  is  satisfied.  We  observe  that  for  any  d G N{do), 


A>[d-do)'^Dn[d-do)  > \\d-dofXr^n{Dn) 

> Wd-dofn'y, 


i.e., 


|0-0o||  < 


^/n^j  7' 

Let  us  consider  an  arbitrary  vector  d which  is  on  the  boundary  dN{do)  of  N{do) 
then  0 is  a vector  which  satisfies  the  following  equation: 


(4.3) 


(9  - e„fD„(9  - e„)  = >1. 


If  we  can  show  that  ln{d)  < Ini^o)  for  all  d G dN{do)  with  probability  approaching 
1,  then  the  log-likelihood  function  has  a local  maximum  in  N (do).  In  addition,  the 
gloabl  maximum  exists  and  lies  in  N{do)  since  the  log-likelihood  function  is  strictly 
concave.  The  consistency  of  is  proved  by  observing  that  Equation  (4.3)  holds. 
Theorem  4.1.  There  exists  a unique  and  consistent  which  maximizes  ln{d). 

Proof.  For  any  G dNn{do),  the  Taylor’s  expansion  of  the  log-likelihood  function 
about  00  is  written  as 

IniOn)  - IniOo)  - (0„  - 0q)^5„(0o)  " ^(^n  ~ 0q)"’T;(0~  ) (0„  - do),  (4.4) 
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where  is  a random  vector  on  the  line  segment  between  and  Oq.  Hence,  the 
vector  OnE  N{9o).  Recall  that  5„(0o)  is  the  vector  of  the  first-order  derivatives  of 
ln{9)  evaluated  at  the  true  parameter  do  and  Fn{0n)  is  the  negative  matrix  of  the 
second  derivatives  of  ln{0)  evaluated  at  We  have 


- ln{Oo)  < o| 

> P{{0n  - 0ofSn{0o)  < \{en  - 0of  Fr,{0n){0n  ~ ^o)} 

> P[{0n  - 0ofSn{0o)  < \a,  - 0oVFn{0n){0n  ~ ^o)  > 

> P[{0n  - 0oVFn{0r,){0n  - 0o)  > \a}  - P{(0„  - 0ofSn{0o)  > \a]  . 

We  proved  in  Lemma  4.1  that 

E{Sr,{0o))  = 0 


and 


H(5„(0o))  = L»n 


Since  (0„  — 0o)^Sni0o)  is  a sum  of  independent  random  variables  with  mean  zero, 
we  obtain  the  following  inequality  by  applying  the  Chebyshev’s  inequality: 


P{{en-0ofSn{0o)>^}< 


16(6>„  - 0ofDn{0n  - 0o)  _ 16 

A’ 


(4.5) 


since  G dN(0o).  The  probability  in  Equation  (4.5)  can  be  made  arbitrarily  small 
by  choosing  A sufficiently  large.  Next  we  need  to  show  that 

P{(0n  - 0ofFn(0n)(0n  ~ Oq)  > \a]  ^ 1,  (4.6) 

as  n — )■  oo. 

Let  I be  a (p  -t-  7G)  x {p  + 7(7)-dimensional  identity  matrix.  If  we  can  show 

r\j 

that,  uniformly  in  N{0o), 


A I,  as  n -»  oo, 


(4.7) 
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then  we  obtain  the  following: 

(0„-0orF„(0i)(0„-0o) 

= {9n  - - 6o) 

> {0n  - 0ofDn{0n  - 0,)\rrnn{D-^'^Fr,{0n)D-^l^) 

= AK,,{D-^l'^Fr,{0n)D-^'^) 

4 A, 

where  G dN{0o)  and  0„G  N{0q).  Therefore,  Equation  (4.6)  holds  as  n — >■  oo. 
Consequently,  if  we  let  both  n and  A become  large, 

P{ln{0n)  < in{0o)}  1 for  0n  ^ dN{0o). 

Unique  MLEs  of  parameters  in  the  model  exist  by  the  strict  concavity  of  the 
log-likelihood  function  and  are  consistent  by  Equation  (4.3).  □ 

We  shall  now  prove  Equation  (4.7)  through  the  following  three  lemmas. 

Note  that  Equation  (4.7)  indeed  shows  that  the  information  matrix  is 
consistently  estimated  by  F„(/9„).  Let  = (a^,b^)  be  any  {pF  /G')-dimensional 
unit  vector,  where  a and  b are  p-  and  /G-dimensional,  respectively. 

Lemma  4.5.  Eor  any  (p  -I-  7G')-dimensional  unit  vector  c, 

^D-^l^\F„(e,)  - A 0,  (4.8) 

where  $o  = (/^o  ) FV  is  the  true  parameter. 

Proof.  See  Appendix  E.  □ 

Lemma  4.6.  Eor  any  (p  4-  7G)-dimensional  unit  vector  c and  any  0 G N{0q), 

c^L>-i/2(F„(0)  - Fn{0o))D-^^^c  A 0. 


Proof.  See  Appendix  G. 


□ 
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Lemma  4.7.  For  any  0 G N{Gq) 

A I(p+/G)x(p+/G)  (4.9) 

Proof.  See  Appendix  H.  □ 

4.4  Asymptotic  Normality  of  the  MLE 

Lemma  4.8.  Let  I be  an  arbitrary  positive  integer,  then  EgQ\\Snj{0)\\‘  is  uniformly 
bounded  for  all  j = 1,  2, . . . , n.  Note  that  Snj{0)  is  defined  in  Equation  (3.1). 

Proof.  See  Appendix  I.  □ 

Lemma  4.9.  For  any  {p  + /G')-dimensional  unit  vector  c, 

c^D-^^^Sn{eo)-^N{0,l).  (4.10) 

Proof.  Denote 

z„  = 

1=1 

where  Snj{0o),  j = 1, 2, . . . , n are  independent  random  variables.  By  Lemma  4.1 

E{Zn)  = 0 

and 

V{Zr,)  = 1. 

By  Liapounov’s  condition,  it  suffices  to  show  that 

1=1 

as  n — >•  oo.  Note  that 


(411) 
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Since  £'e(,||S'„j(0o)||^  is  uniformly  bounded  for  all  individuals  by  Lemma  4.8, 
Equation  (4.11)  holds  and  the  proof  is  accomplished  by  the  Central  Limit  Theorem. 

□ 

Theorem  4.2.  The  MLE  are  asymptotic  normal,  i.e., 

D]^‘^{dn  — 0o)  ^ N{0,  I)  as  n ^ oo. 

Proof.  By  Taylor’s  expansion,  we  obtain  that  for  some  9 on  the  line  segment 
between  On  and  Oq  and  for  any  unit  vector  u, 

n^D-^I^Sn{K)  - n^D-^I^SniOo)  - - 9o). 

Also  because  \x^ Dn^^^Sn{9n)  = 0 since  is  the  MLE, 

n^D-y^Sn{9,)  = M^D-^'^FnmTn  ~ 9,) 

= n^D-^l\Fn{9)  - Dn){Tn  - 0o)  + n^D-^I^Dn{Tn  - 0o) 

= n^D-^l\Fn{9)  - Dn)D-^l‘^D]l\Tn  - 0o)  + n^D]!\Tn  - 9o) 

= n^[D-^l\Fn{9)  - Dn)D-^l^]D]l^{rn  - 0o)  + Vi^D]!\Tn  - 0o). 

Since  G N{9q),  Equation  (4.9)  holds  by  Lemma  4.9.  Therefore,  the  sum  of  the 
absolute  values  of  all  elements  of  [T)n^^^(F„(0)  — D„)Dn^^^]  goes  to  0 in  probability. 
Hence, 

= o^, 

where  Op(l)  is  a matrix  with  all  elements  converging  to  0 as  n oo.  Then  we 
obtain  that 

u^D-^/^Sn{9o)  = u^[0p(l)]F>y2(^  _ ^ _ 0,). 


Therefore, 


- 0,)  u^D-^^^Sn{9o)  + Op(l). 


42 


Since  we  have  proved  the  normality  of  in  Lemma  4.7,  the  normality 


of  Dn^^‘^{6n  — 9q)  follows.  Hence,  Theorem  4 .2  is  proved. 


□ 


4.5  Goodness-of-Fit  Test 


In  practice,  one  might  be  interested  in  knowing  how  many  intervals  are 
in  fact  necessary  in  a specific  study.  Therefore,  a goodness-of-fit  test  can  be 
performed  to  test  how  many  time  intervals  are  adequate.  Assume  that  we  have  a 
saturated  model  with  A + 12  intervals  and  an  unsaturated  model  with  I\  intervals. 
We  are  interested  in  testing  whether  there  is  significant  difference  between  the 
two  models.  We  assume  that  the  unsaturated  model  is  obtained  by  combining 
intervals  A + 1,  7i  + 2,  . . .,  7i  + 12  in  the  saturated  model  with  interval  7i.  The  null 
hypothesis  can  be  written  as 

Ho  - A(/j+i)p  = A(/i+2)g  = • • • = A(/i+/2)g  = A/jg,  V5' = 1,  2, . . . , G.  (4.12) 
Since  6^  = {0^  A^),  the  null  hypothesis  defined  above  can  also  be  written  as 


77o  : 770  = 0, 


(4.13) 


where  77  is  a design  matrix  with  full  row  rank  I2G  and  is  written  as 


0 


0|;_i  0 0 0 

0 0 0 


0 


0 


-1 


Of^_i  0 0 0 


0 


0 


0 


or,-i  1 -1  0 

Of,-i  1 0 -1 


0 


0 


0 0 0 


0 


Ot.-I  1 0 0 
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where  Os  in  bold  are  null  vectors.  Note  that  the  first  p columns  of  the  matrix 
H put  no  restriction  on  (3.  Columns  (p  + 1)  through  (p  + 7i  — 1)  represent  the 
nonrestrictiveness  of  A for  treatment  group  1 corresponding  to  the  first  /i  — 1 
intervals.  Similar  patterns  are  presented  for  the  rest  of  the  columns.  We  will  now 
construct  a likelihood  ratio  test  for  the  hypothesis  in  Equation  (4.13).  To  maximize 
the  log-likelihood  function  ln{G)  under  Hq  is  equivalent  to  maximize 

U9)  - 

unrestrictively  in  0 € i?[p+Ci+^2)G]  ^ ^ where  7 is  a vector  of  Lagrange 

multipliers. 

If  we  let  9(1  be  the  true  parameters,  9n  be  the  parameters  estimated  under 

^ • T 

Hq,  and  be  the  estimated  parameters  under  the  saturated  model,  then 
satisfies  the  following  equation: 

Snm-H^  = 0,  (4.14) 

where  7 is  a vector  of  the  estimated  7 under  Hq.  Also  under  the  null  hypothesis, 
H9ji  = 0 and  H9q  = 0, 

H{9n  - 9q)  = 0.  (4.15) 

Now  we  need  to  solve  for  9n  from  Equations  (4.14)  and  (4.15).  By  Taylor’s 
expansion, 

- So)  + 0,(1), 

hence 

D]l\9n  - 9q)  + D-V25„(0„)  = D-^/^Sn{9o)  + 0,(1). 

By  Equation  (4.14),  the  above  equation  can  be  written  as 


D'Ae.  - «o)  + = D-'/^S„(e,)  + Op(l). 


(4.16) 
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We  can  rewrite  Equation  (4.15)  as 

- 9„)  = 0. 

Now  express  Equations  (4.16)  and  (4.17)  in  a matrix  formula, 


(4.17) 


( 


1-1/2  \ / ^ 


Dn  ' 


HD, 


-1/2 


0 


/ 


V 


7 


/ 


J-^n 

\ 


5n(6>o)  + Op(l) 

0 


(4.18) 


/ 


Since  H is  of  full  row  rank  DG  and  D„  is  nonsingular,  is  of  full 


rank  I2G.  Hence,  the  inverse  of  matrix 
written  as 


HD, 


-1/2 


0 


exists  and  can  be 


/ 

1 

1 

^ I -P 

Q ^ 

° ) 

\ 

V 

where 


Q = and 

R = 


Therefore,  we  can  solve  Equations  (4.18)  as 


^ E>y'(0„  - 9o)  ^ 

^ I- P 

V ^ ) 

^ R j 

\ 

T>n'^n(6>o)  + Op(l) 

0 


\ 


hence, 

D\l‘^(en  - Oo)  = (/  - P)D-^^^Sn{do)  + Op(l). 


Similarly  for  0„,  we  have 


Dl^^iOn  - do)  = D-^l‘^Sn{9o)  + Op(l), 
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by  Taylor’s  expansion  and  by  observing  the  fact  that  = 0.  Therefore, 


D]!\9n-en)  = D]i\e^-e,)-Dll\9r,-e, 
= PD-^I^Sn{9,)  + o,{l). 


(4.19) 


Note  that  PP  = P is  idempotent  with  full  rank  /2G';  therefore,  P can  be  written  as 

A, 


P = A^ 


( I 0^ 


V 


0 0 


/ 


where  A is  an  orthogonal  matrix.  Let  then 


z„4iV(o,/), 


by  Lemma  4.9.  From  Equation  (4.19),  we  obtain  that 


(0„  - 9^^ Dn{9n  - 9n) 

= S^{9o)D-^/^PD-^^^Sr^i9o)  + Op(l) 


= S'^{9o)D-^/^A^ 


^ I 0^ 
t/sG  U 


V 


0 0 


AP-i/2p„(0o)  + o^(l) 


Zi 


( 1 0^ 
P%G  U 


/ 


0 0 

4g  0 

0 0 


Zn  + Gp(l) 


Z + Op(l) 


hG- 


(4.20) 


The  likelihood  ratio  test  statistic  for  the  hypothesis  in  Equation  (4.12)  can 
be  written  as 


A 


SUPegeo  ^n{Q)  ^ Ln{9n) 
supee©T„(6>)  ” L„(0„)’ 
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where  © represents  the  overall  parameter  space  and  ©o  the  parameter  space  under 
Hq.  By  Taylor’s  expansion,  we  can  express  — 21og(A)  as 

-2l0g(A)  = -2[ln{dn)  - ln{K)] 

= -2[(0„  - - OnfFr^mOn  ~ K). 

where  6 lies  between  and  On  and  therefore  0 G Nn{A).  Note  that  Sn{6n)  = 0 
and  Fn{9)  can  be  replaced  by  from  Lemma  4.7.  Hence,  — 21og(A)  is  written  as 

-2  log(A)  = [On  - OnfDniOn  ~ On)  + Op{l), 

which  converges  in  distribution  to  ^7  Equation  (4.20). 


CHAPTER  5 
SIMULATIONS 

5.1  Introduction 

The  results  discussed  in  the  previous  chapters  showed  the  asymptotic 
properties  of  parameter  MLEs  in  the  piecewise  Gompertz  model.  These  results 
enable  us  to  carry  out  the  hypothesis  test  of  which  the  null  hypothesis  is  proposed 
in  Equation  (2.3).  In  this  chapter,  Monte  Carlo  simulation  studies  are  performed 
in  order  to  evaluate  the  normality  approximation  of  the  proposed  test  statistic 
in  Equation  (2.6)  and  the  power  of  the  test  under  different  settings  based  on 
the  amount  of  location  shift  and  the  sample  size.  Due  to  the  complexity  of 
the  piecewise  model,  the  procedure  for  generating  the  random  sample  is  quite 
complicated  and  is  discussed  in  section  5.2.  Results  are  given  on  observed  Type 
I errors  and  powers  for  both  the  piecewise  Gompertz  model  and  the  exponential- 
logistic  mixture  model.  All  simulations  were  completed  on  the  Sun  Ultra, 
Enterprise  450  computer  using  Ox  2.0  (Doornik  1998).  Random  number  generators 
in  the  Ox  2.0  environment  were  used  to  generate  uniformly  distributed  random 
numbers. 


5.2  Generation  of  Random  Samples 

In  this  section  we  describe  the  procedure  for  generating  the  random  sample 
when  the  sample  follows  a piecewise  Gompertz  model  based  on  certain  parameter 
settings.  In  order  to  generate  a random  sample  of  size  n with  a specified  proportion 
of  censoring,  we  need  to  generate  a sample  of  failure  observations  of  size  n and 
a sample  of  censored  observations  of  size  n.  The  sample  of  failure  observations 
follows  the  piecewise  Gompertz  model.  The  sample  of  censored  observations 
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is  assumed  to  follow  a uniform  (0,  M)  distribution,  where  M is  defined  by  the 
censoring  proportion.  The  final  sample  is  obtained  by  comparing  the  two  samples 
generated.  A subject  is  considered  failed  if  its  observation  from  the  first  sample 
is  smaller  than  the  one  obtained  from  the  second  sample.  The  subject  is  censored 
if  its  observation  from  the  first  sample  is  greater  than  the  one  from  the  second 
sample. 

5.2.1  Parameter  Settings 

We  consider  five  different  parameter  settings  in  our  Monte  Carlo  simula- 
tions. There  is  no  treatment  difference  in  random  samples  generated  based  on 
Settings  1 and  2.  Therefore  we  can  compare  Type  I errors  between  tests  based  on 
random  samples  generated  from  these  two  settings.  In  parameter  Settings  3,  4,  and 
5,  treatment  effects  exist  between  groups.  Settings  3 to  5 present  the  smallest,  the 
intermediate,  and  the  largest  difference  between  treatment  groups,  respectively. 

The  comparison  of  powers  of  tests  are  based  on  random  samples  generated  from 
these  three  parameter  settings.  All  five  parameter  settings  are  listed  as  follows: 
Setting  1: 

• Two  treatment  groups 

• Three  intervals:  [0,3),  [3,7),  and  [7,  oo) 

• One  covariate  with  value  0 or  1 

• Cure  proportion  varied  with  covariate  values 

• Censoring  rates:  30% 

• Aio  = All  = —4.2 

• A20  = A21  ==  —0.52 

• A30  = A31  = —0.2 

• /?o  = -0.1,  A = 0.2 

Setting  2: 

• Two  treatment  groups 
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Three  intervals:  [0,4),  [4,10),  and  [10,  oo) 

One  covariate  with  value  0 or  1 

Cure  proportion  varied  with  covariate  values 

Censoring  rates:  30% 

-^10  = = —3.5 

A20  = A21  = —0.45 
A30  = A31  = —0.15 
/3o  = -0.2,  A = 0.15 
Setting  3: 

Two  treatment  groups 

Three  intervals:  [0,3),  [3,7),  and  [7, 00) 

One  covariate  with  value  0 or  1 

Cure  proportion  varied  with  covariate  values  and  treatment  groups 
Censoring  rates:  30% 

Aio  = —3.5  and  An  = —4.2 
A20  = —0.45  and  A21  = —0.52 
A30  = — 0.21  and  A31  - — 0.2 
/3o  = -0.1,  /?i  = 0.2 
Setting  4: 

Two  treatment  groups 

Three  intervals:  [0,4),  [4,10),  and  [10, 00) 

One  covariate  with  value  0 or  1 

Cure  proportion  varied  with  covariate  values  and  treatment  groups 
Censoring  rates:  30% 

Aio  = —3.9  and  An  = —4.2 
A20  = —0.45  and  A21  = —0.4 
• A30  = —0.165  and  A31  = —0.18 


• /?o  = -0.2,  13,  = 0.15 
Setting  5: 

• Two  treatment  groups 
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• Three  intervals:  [0,4),  [4,10),  and  [10,  oo) 

• One  covariate  with  value  0 or  1 

• Cure  proportion  varied  with  covariate  values  and  treatment  groups 

• Censoring  rates:  30% 

• Aio  = —3.5  and  An  = —4.2 

• A20  = —0.45  and  A21  = —0.4 

• A30  = —0.15  and  A31  = —0.18 


• po  = -0.2,  A = 0.15 

The  survival  curves  for  the  above  five  parameter  settings  are  shown  in 
Figures  (5-1)  through  (5-5).  The  survival  probabilities  are  calculated  by  applying 


S(«)  = 


+ 


p^ig^ 


m=\ 


A 


mg 


A 


*5 


)}. 


(5.1) 


for  t e [rj_i,ri).  There  are  two  separate  survival  curves  in  the  first  two  figures  for 
individuals  with  covariate  values  of  0 and  1.  For  Figures  (5-3),  (5-4),  and  (5-5),  we 
draw  four  survival  curves  for  all  four  possible  combinations  of  the  covariate  and  the 
treatment  group  indicator. 


5.2.2  Generation  of  Censored  Observations 

From  the  assumption,  the  sample  of  censored  observations  follows  a uniform 
(0,  M)  distribution.  Therefore,  we  can  obtain  a sample  with  a uniform  (0, 1) 
distribution  and  then  transform  them  into  a sample  that  follows  a uniform  (0,  M) 
distribution.  The  primary  task  is  to  obtain  an  appropriate  value  for  M.  Let  U 
and  T denote  random  variables  for  censored  and  failure  observations,  respectively. 
By  taking  parameter  setting  1 as  an  example,  we  illustrate  the  derivation  of  the 
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Figure  5-1.  Survival  curves  for  patients  with  parameter  setting  1. 
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Figure  5-2.  Survival  curves  for  patients  with  parameter  setting  2. 
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Figure  5-3.  Survival  curves  for  patients  with  parameter  setting  3. 
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Figure  5-4.  Survival  curves  for  patients  with  parameter  setting  4. 
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Figure  5-5.  Survival  curves  for  patients  with  parameter  setting  5. 
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censoring  proportion  P{U  < T)  as  follows. 


1 e<?0+<3lJ^ 


Aij  \ > 


dc, 


for  M E [0,  3); 


P{U  < T) 


1 _ p-^ljC 

^ 


A 


19 


1 1 

+ w exple^o+'^i* — 

iVl  Mg  J3 

for  M 6 [3,  7);  and  for  M G [7, 00), 


pM 

} J exp{< 


,/3o+;3ix ' 


o^2g^  


X 


2s 


-}dc, 


P{U  < T) 


1 /'3  1 _ pAigC 

li  I exp{e'>»+'>--j^}dc 


+ 

M 


1 pAit;3  pA2o3  PA23C 

p{g/?o+/3ix — ^ j.  / exp{e^“'''^^^ ^ }dc 

^19  Jz 


1 1 /3A103  o^2gZ  _ oA2o7 

M ‘- 


A 

-M 
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A 
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A 
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/M 

exp{( 


,/^o-\-(3ix 


gAsg7  — gAspC 


A: 
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}dc. 


Calculation  of  integrals  in  the  above  equations  are  carried  out  approximately 
using  the  Simpson’s  Formula.  The  detail  of  the  Simpson’s  Formula  is  provided 
in  Appendix  J.  Therefore,  we  search  for  an  appropriate  value  for  M such  that 
P{U  < T)  reaches  the  required  censoring  proportion. 


5.2.3  Generation  of  Failure  Observations 

In  order  to  generate  a sample  of  failure  observations,  we  need  to  generate 
a sample  of  uniform  (0, 1)  observations.  Then  we  can  treat  the  value  from  the 
uniform  (0, 1)  as  the  survival  probability  for  the  failure  time  T.  By  inverting  the 
survival  function  defined  in  Equation  (5.1),  we  obtain  the  value  for  the  failure  time. 
Note  that  special  attention  is  required  at  the  inversion  because  we  are  dealing  with 


57 


piecewise  distribution.  Examples  are  given  for  observations  with  covariate  a;  = 0 in 
the  first  parameter  setting. 

From  Figure  (5-1),  we  observe  that  each  survival  curve  consists  of  three 
pieces  based  on  the  three  intervals  we  consider  in  the  study.  Therefore,  the 
inversion  function  for  generating  the  failure  time  observation  depends  on  the  value 
of  the  generated  uniform  random  number  U*.  The  survival  probability  when  t = 0 
is  1,  the  survival  probability 

1 ^—4.2x3 

P{T  > 3)  = exp{e“°-^ — — } = 0.81, 


and 


P{T  > 7)  = exp{e 


-0.1 


1 — e 


-4.2x3  ^-0.52x3  ^-0.52x7 


-4.2 


+ 


— e 


-0.52 


-]}  = 0.59. 


Therefore  if  u*  G [0.81, 1],  then 

T = ^ log[e^i»"''>  - log(i/*)]; 


Is 


if  u*  G [0.59,0.81),  then 


= logj  - X^g  [e  log(it*) 
Mg 


^AlgTo  p^lgTl 


X 


Is 


and  if  u*  G [0,  0.59),  then 


- Aas  [e-^“  log(u*)  - {- + 


A,; 


Ag 


Hg  '^2g 

Note  that  it  is  necessary  that  there  is  at  least  one  failure  observation  in  each 
interval  for  each  treatment  group  in  order  to  make  inferences  on  all  parameters. 


5.3  Type  I Error  and  Power 

Type  I errors  and  powers  of  tests  are  calculated  based  on  the  piecewise 
Gompertz  model  and  on  the  exponential-logistic  mixture  model  for  all  five 
sets  of  parameters.  The  test  statistic  based  on  the  piecewise  Gompertz  model 
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proposed  in  this  dissertation,  as  we  recall,  was  presented  in  Equation  (2.6).  For  the 
exponential-logistic  mixture  model,  we  assume  that  there  is  one  covariate  x in  the 
model.  The  time  to  failure  is  assumed  to  follow  an  exponential  distribution  with 
mean  1/A,  where  A is  modelled  as  a function  of  the  covariate  x and  the  treatment 
indicator  g.  The  parameter  A is  presented  as  exp(7o  + 7ix  -I-  72^)-  The  cure  rate 
P{Z  = 0)  is  defined  as  i_^^0o+\x+$29  • Therefore,  for  a fixed  covariate  value,  the  test 
of  the  equality  of  cure  rates  between  two  treatment  groups  is  equivalent  to  the  test 
of  JTo  : /^2  = 0.  Hence,  the  standardized  test  statistic  for  this  null  hypothesis  is 


Type  I errors  based  on  the  first  two  parameter  settings  for  both  models 
are  shown  in  Table  (5-1).  Powers  for  tests  based  on  parameter  settings  3 through 
5 for  both  models  are  presented  in  Table  (5-2).  All  tests  are  carried  out  for 
10000  generated  random  samples  and  for  three  different  sample  sizes:  400, 

1000,  and  2000.  The  numbers  of  unobtainable  test  statistics  (#  NaNs)  for  the 
exponential-logistic  mixture  model  are  shown  in  the  last  column  of  each  table.  The 
unobtainable  test  statistics  occur  because  the  likelihood  function  for  the  mixture 
model  is  close  to  flat  at  the  point  that  MLEs  are  obtained.  When  the  sample 
size  becomes  larger,  however,  unobtainable  or  unidentifiable  test  statistics  are 
eliminated. 

We  present  bar  plots  of  all  standardized  test  statistics  obtained  for  all 
three  sample  sizes  based  on  those  five  parameter  settings.  These  plots  help  to 
visualize  the  advantages  of  the  piecewise  Gompertz  model  in  terms  of  the  normal 
approximation  of  the  test  under  the  null  hypothesis  and  the  power  of  the  test  under 
the  alternative  hypothesis.  Figures  (5-6),  (5-7),  and  (5-8)  are  the  distributions  of 
the  test  statistics  under  parameter  setting  1 for  both  the  mixture  model  and  the 
piecewise  Gompertz  model.  These  three  figures  use  sample  sizes  of  400,  1000,  and 


TS  = 
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Table  5-1.  Type  I Errors  for  the  Test  of  Equality  of  Cure  Proportions  for  the  Mix- 
ture and  the  Three-Piece  Gompertz  Models. 


Parameter  Setting 

Sample  Size 

Type  I Error 

# NaNs 

Mixture 

Gompertz 

Mixture  Gompertz 

1 

(200,  200) 

0.0305 

0.0375 

220 

(500,  500) 

0.0171 

0.0412 

1 

(1000,  1000) 

0.0365 

0.0412 

2 

(200,  200) 

0.0203 

0.0373 

69 

(500,  500) 

0.0356 

0.0360 

(1000,  1000) 

0.0402 

0.0408 

Table  5-2.  Powers  for  Tests  of  Equality  of  Cure  Proportions  for  the  Mixture  and 
the  Three-Piece  Gompertz  Models. 


Parameter  Setting  Sample  Size 

Power 

# NaNs 

Mixture 

Gompertz 

Mixture  Gompertz 

3 

(200,  200) 

0.0260 

0.1016 

127 

(500,  500) 

0.0328 

0.2024 

(1000,  1000) 

0.0679 

0.3703 

4 

(200,  200) 

0.0616 

0.1896 

5 

(500,  500) 

0.0357 

0.4351 

(1000,  1000) 

0.2721 

0.7379 

5 

(200,  200) 

0.2706 

0.5572 

28 

(500,  500) 

0.7180 

0.9331 

(1000,  1000) 

0.9601 

0.9981 
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2000,  respectively.  Similarly,  Figures  (5-9),  (5-10),  and  (5-11)  are  for  parameter 
setting  2;  Figures  (5-12),  (5-13),  and  (5-14)  are  for  parameter  setting  3;  Figures 
(5-15),  (5-16),  and  (5-17)  are  for  parameter  setting  4;  and  Figures  (5-18),  (5-19), 
and  (5-20)  are  for  parameter  setting  5. 

From  these  tables  and  plots,  we  observe  that  Type  I errors  obtained  from 
the  three-piece  Gompertz  model  are  closer  to  0.05  than  those  from  the  mixture 
model.  Powers  presented  in  Table  (5-2)  for  the  three-piece  Gompertz  model  are 
much  larger  than  ones  for  the  mixture  model  for  all  sample  sizes.  The  estimation 
procedure  for  obtaining  MLEs  is  also  easier  to  carry  out  for  the  three-piece 
Gompertz  model  than  for  the  exponential-logistic  model.  In  addition,  the  algorithm 
for  the  three-piece  Gompertz  model  converges  much  faster. 
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Figure  5-6.  Standardized  test  statistics  for  parameter  setting  1 for  10000  random 
samples  with  Size  400.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-7.  Standardized  test  statistics  for  parameter  setting  1 for  10000  random 
samples  with  size  1000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-8.  Standardized  test  statistics  for  parameter  setting  1 for  10000  random 
samples  with  size  2000.  a)  Mixture  Model;  b)  Three-piece  Gompertz  model. 
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Figure  5-9.  Standardized  test  statistics  for  parameter  setting  2 for  10000  random 
samples  with  size  400.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-10.  Standardized  test  statistics  for  parameter  setting  2 for  10000  random 
samples  with  size  1000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-11.  Standardized  test  statistics  for  parameter  setting  2 for  10000  random 
samples  with  size  2000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-12.  Standardized  test  statistics  for  parameter  setting  3 for  10000  random 
samples  with  size  400.  a)  Mixture  model;  b)  Three-Piece  Gompertz  model. 
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Figure  5-13.  Standardized  test  statistics  for  parameter  setting  3 for  10000  random 
samples  with  size  1000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-14.  Standardized  test  statistics  for  parameter  setting  3 for  10000  random 
samples  with  size  2000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-15.  Standardized  test  statistics  for  parameter  setting  4 for  10000  random 
samples  with  size  400.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-16.  Standardized  test  statistics  for  parameter  setting  4 for  10000  random 
samples  with  size  1000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-17.  Standardized  test  statistics  for  parameter  setting  4 for  10000  random 
samples  with  size  2000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-18.  Standardized  test  statistics  for  parameter  setting  5 for  10000  random 
samples  with  size  400.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 
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Figure  5-19.  Standardized  test  statistics  for  parameter  setting  5 for  10000  random 
samples  with  size  1000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 


75 


a 


-8  -6  -4  -2  0 2 4 

Standardized  Test  Statistic 


b 


-8  -6  -4  -2  0 2 4 

Standardized  Test  Statistic 


Figure  5-20.  Standardized  test  statistics  for  parameter  setting  5 for  10000  random 
samples  with  size  2000.  a)  Mixture  model;  b)  Three-piece  Gompertz  model. 


CHAPTER  6 
EXAMPLE 

6.1  Introduction 

In  Chapter  2 we  proposed  a test  statistic  that  addresses  the  hypothesis 
presented  in  Equation  (2.3).  Subsequently  in  Chapter  3 and  Chapter  4 we 
established  the  asymptotic  normality  of  parameter  estimates  in  the  piecewise 
Gompertz  model.  The  asymptotic  normality  validated  the  test  of  hypothesis 
proposed  in  Chapter  2.  Chapter  5 reported  Monte  Carlo  Type  I error  and  power 
calculations,  which  established  the  strength  of  our  test  statistic  on  the  comparison 
of  cure  rates  under  various  parameter  settings.  In  this  chapter,  we  demonstrate  the 
applicability  of  our  test  statistic  with  a data  set. 

The  data  set  comes  from  a study  by  the  Pediatric  Oncology  Group  (POG) 
at  the  university  of  Florida  (Ravindranath  et  al.  1996).  The  motivation  of  this 
study  is  to  compare  cure  proportions  among  patients  with  acute  myeloid  leukemia 
(AML)  under  different  treatments.  Three  treatments,  allogeneic  bone  marrow 
transplantation  (alio.  BMT),  autologous  bone  marrow  transplantation  (auto. 

BMT),  and  intensive  chemotherapy,  are  performed  in  the  study.  In  order  to  register 
in  the  study,  all  patients  are  required  to  be  in  remission  after  the  first  two  courses 
of  induced  chemotherapy.  The  POG  evaluated  666  patients  whose  ages  were  under 
21  from  June  1988  to  March  1993.  A total  of  552  patients  entered  remission  and 
are  therefore  eligible  for  the  study.  Of  the  eligible  patients,  89  patients  received 
alio.  BMT;  117  were  assigned  to  an  additional  six-course  intensive  chemotherapy; 
and  115  were  assigned  to  receive  auto.  BMT.  There  were  231  patients  who  did  not 
participate  in  the  study  due  to  various  reasons. 
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6.2  Methodology 

Our  objective  based  on  this  data  set  is  to  perform  pairwise  comparisons 
on  cure  rates  between  treatment  groups  after  controlling  for  patient’s  gender.  We 
compare  tests  for  three  parametric  models  and  for  a proportion  test  based  on  KM 
estimates  of  cure.  The  three  parametric  models  are  the  logistic-exponential  mixture 
model,  the  one-piece  Gompertz  model,  and  the  three-piece  Gompertz  model.  In 
addition,  we  present  cure  rates  and  their  standard  deviations  for  every  treatment 
and  gender  combination  based  on  estimates  from  parametric  models  and  on  KM 
estimates.  In  the  end,  we  compare  the  standard  one-piece  Gompertz  model  with 
the  three-piece  Gompertz  model  by  applying  the  goodness-of-fit  test  we  derived 
in  Ghapter  5.  In  this  section,  we  will  briefly  describe  1)  three  parametric  models 
and  their  corresponding  test  statistics;  and  2)  the  proportion  test  based  on  KM 
estimates  of  cure  proportions. 

In  the  exponential-logistic  mixture  model,  we  assume  an  exponential 
distribution  with  parameter  A for  the  failure  time  and  a logistic  regression 
model  for  the  cure  proportion.  The  expression  log(A)  is  defined  as  a linear 
function  of  the  covariate  x and  the  treatment  indicator.  We  define  two  dummy 
variables  to  represent  the  three-level  treatment  indicator.  They  are  presented  as 

Gi  — /[if  patient  in  treatment  1]  &nd  G2  — -f[if  patient  in  treatment  2]-  The  SUrvival  function 

for  the  failure  time  given  that  the  observation  is  not  a long-term  survivor  (Z  = 1)  is 
S*{t\Z  = 1)  = (6.1) 

The  cure  probability  is  defined  as 

~ g/3o+/3iXi+/923i+/33S2  ' 


(6.2) 
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Therefore  the  hypothesis  of  testing  the  equality  of  cure  rates  in  three  treatment 
groups  can  be  written  as 

-ffo  ■ ^2  = 03  = 0.  (6-3) 

Pairwise  comparisons  between  two  treatments  are  based  on  the  following  standard- 
ized test  statistics: 


Ti  = 

P2  — 03 

(6.4) 

\Iv02-03) 

T2  = 

. , and 

vm) 

(6.5) 

Ts  = 

03 

(6.6) 

\/V{03) 

Note  that  Ti  is  the  test  statistic  that  compares  cure  proportions  between 
Treatments  1 and  2;  T2  compares  cure  proportions  between  Treatments  1 and 
3;  and  T3  compares  cure  proportions  between  Treatments  2 and  3.  The  variances 
are  obtained  by  inverting  the  observed  information  matrix  obtained  by  taking  the 
second  derivatives  of  the  log-likelihood  function  with  respect  to  parameters. 

The  one-piece  Gompertz  model  is  a special  case  of  the  piecewise  Gompertz 
model.  Let  Xg  be  the  parameter  associated  with  the  treatment  g and  /?i  the 
coefficient  associated  with  the  covariate  x.  For  Xg  < 0,  the  cure  proportion  for 
patients  with  the  treatment  g and  the  covariate  x is  written  as 

’Kg{x)  = (6.7) 

If  Xg  < 0,  there  is  cure  proportion  for  patients  under  the  treatment.  If  Ag  < 0 
for  any  g,  then  the  hypothesis  of  testing  the  equality  of  cure  proportions  is  written 
as 


Ho  ■ Ai  — A2  — A3. 


(6.8) 
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Standardized  test  statistics  for  pairwise  comparisons  of  cure  proportions  are 
presented  as 

Ai  — A2 


Ti 


T,  = 


X\  — A3 
^1/(Ai-A3) 
A2  — A3 


, and 


(6.9) 

(6.10) 

(6.11) 


y/v{X2  - A3) 

Similarly  these  three  standardized  test  statistics  are  for  the  comparison  of  cure 
proportions  between  Treatments  1 and  2,  between  Treatments  1 and  3,  and 
between  Treatments  2 and  3,  respectively.  The  variances  are  obtained  from  the 
inverse  of  the  observed  information  matrix. 

Finally,  as  we  have  discussed  in  Chapter  2,  the  cure  proportion  in  a 
piecewise  Gompertz  model  as  a function  of  the  covariate  x and  the  treatment  g 
is  expressed  in  Equation  (2.4).  We  consider  a three-interval  case  in  this  example. 
Denote  the  three  intervals  by  [ro,ri),  [ri,T2),  and  [r2,r3),  where  ro  = 0 and  T3  = 00. 
When  Xsg  < 0 for  any  g,  testing  the  equality  of  cure  proportions  for  any  given  value 
of  X is  equivalent  to  testing 


Ho  : Yo 


— 1 — 1 ^ — 1 


A. 


il 


E 


A,: 


i2 


= E 


A 


i3 


(6.12) 


i=l  i=l  i—1 

Standardized  test  statistics  for  pairwise  comparisons  between  treatment  groups  are 
presented  as  follows: 


Ti  = 


To  = 


i=l 


Xi2 


(6.13) 


^(ELi  - 

y-3 


’’i 


Ai 


Ail  * ^ Ai2 

E3  e^‘3 

i=l 


Ai3 


and 


^(^3  g-^tiU-i  _e-^ii’"i  _ y-^3 
* ^ Ail 


^i3’"i-l  _gAi3T. 


Ai3 


4 


(6.14) 
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Y^3 

2-^iz=:\ 


2^i— 1 _eAj2^i 

Ai2 


- 

Z^i=l 


^i3'^i-l -e^iSn 
-^i3 


(6.15) 


e^i2n-i —g>^i2'^i 

*'  vZ^i=i  r:  z^i=i 


3 e^i3’-i-i_eAi3Ti 


) 


•^i2  * ^ A,3 

These  test  statistics  Ti,T2,  and  T3  are  for  comparisons  of  cure  proportions  between 
Treatments  1 and  2,  between  Treatments  1 and  3,  and  between  Treatments  2 and  3, 
respectively.  Variance  estimates  are  not  quite  straightforward  as  for  tests  based  on 
the  mixture  model  and  on  the  one-piece  Gompertz  model.  They  have  already  been 
presented  in  Chapter  2 and  will  not  be  repeated  here. 


Proportion  tests  of  KM  estimates  of  cure  for  pairwise  comparisons  between 
treatments  are  generated  based  on  KM  estimates  of  cure  proportions  and  standard 
errors  of  the  estimated  cure  proportions. 


P-values  for  tests  derived  from  parametric  models  and  for  the  proportion 
test  will  be  presented  in  the  next  section.  In  addition,  we  present  survival  curves 
estimated  based  on  both  parametric  models  and  KM  estimates.  Furthermore, 
we  present  the  cure  proportion  and  the  corresponding  standard  deviation  for 
each  treatment  and  gender  combination  based  on  parametric  estimates  and  on 
KM  estimates.  The  result  of  comparing  the  one-piece  Gompertz  model  with  the 
three-piece  model  by  the  goodness-of-fit  test  we  derived  earlier  is  also  presented. 


6.3  Results 

6.3.1  Pairwise  Comparisons  on  Cure  Proportions 

In  Table  (6-1),  we  present  p-values  for  tests  based  on  three  parametric 
models  and  for  the  proportion  test  based  on  KM  estimates.  The  three  intervals 
for  the  three-piece  Gompertz  model  are  defined  as  [0, 50),  [50, 140),  and  [140, 00). 
All  numbers  presented  here  are  in  terms  of  weeks  in  follow-up.  Tests  based  on  the 
exponential-logistic  mixture  model,  the  three-piece  Gompertz  model,  and  the  KM 
estimates  detect  the  difference  in  cure  proportions  between  alio.  BMT  and  auto. 
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BMT  and  the  difference  between  alio.  BMT  and  intensive  chemotherapy.  The  alio. 
BMT  increases  cure  proportion  significantly  more  than  the  other  two  treatments. 
There  is  no  significant  difference  between  auto.  BMT  and  chemotherapy.  A test 
based  on  the  one-piece  Gompertz  model  does  not  show  any  significant  difference 
between  any  two  of  these  three  treatments.  The  maximization  procedures  obtaining 
MLEs  of  parameters  for  both  the  one-piece  Gompertz  model  and  the  three- 
piece  Gompertz  model  are  smooth.  The  algorithm  converges  within  a few  steps. 
However,  there  is  a convergence  problem  related  to  the  mixture  model.  Different 
starting  values  for  parameters  are  chosen  in  order  to  obtain  the  convergence.  We 
have  discussed  the  convergence  problem  related  to  the  mixture  model  in  both 
Chapter  1 and  Chapter  5.  The  convergence  problem  we  encounter  here  confirms 
our  earlier  assertion. 


Table  6-1.  Comparison  of  Tests  from  the  Mixture  Model,  the  One-Piece  Gompertz 
Model,  and  the  Three-Piece  Gompertz  Model,  and  the  Proportion  Test  on  Cure 
Proportions. 


Comparisons 

P-values  for  Treatment  Comparisons 

Mixture 

1-Piece 

Gompertz 

3- Piece 
Gompertz 

Proportion 

Test 

Alio.  BMT  vs  Auto.  BMT 

0.016 

0.384 

0.015 

0.011 

Alio.  BMT  vs  chemotherapy 

0.012 

0.101 

0.003 

0.007 

Auto.  BMT  vs  chemotherapy 

0.690 

0.516 

0.897 

0.741 

In  Figures  (6-1)  to  (6-4),  we  show  estimated  survival  curves  from  different 
parametric  models  and  from  KM  estimates.  The  three  curves  on  Figure  (6-l)a  are 
survival  curves  for  three  treatment  groups  for  male  patients  based  on  the  mixture 
model.  Figure  (6-l)b  presents  three  treatment  survival  curves  for  female  patients 
based  on  the  mixture  model.  Figure  (6-2)  presents  survival  curves  estimated  from 
the  one-piece  Gompertz  model  for  male  and  female  patients  in  three  treatment 
groups.  Survival  curves  estimated  from  the  three-piece  Gompertz  model  are 


presented  in  Figure  (6-3).  The  KM  estimates  of  survival  curves  are  presented  in 
Figure  (6-4). 
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6.3.2  Cure  Proportion  Estimates 


Estimates  of  cure  proportions  are  presented  in  Table  (6-2).  We  present 
estimates  of  cure  rates  and  their  standard  deviations  from  the  mixture  model,  the 
one-piece  Gompertz  model,  the  three-piece  Gompertz  model,  and  the  KM  estimate 
for  all  combinations  of  patient’s  gender  and  three  treatment  groups.  Treatment 
1 represents  alio.  BMT,  treatment  2 represents  auto.  BMT,  and  Treatment  3 
represents  intensive  chemotherapy.  Note  that  cure  proportion  exists  for  each  of  the 
combinations.  Female  patients  show  higher  cure  probability  than  male  patients. 
Standard  errors  for  cure  proportions  estimated  from  both  the  three-piece  Gompertz 
model  and  the  mixture  model  are  smaller  compared  to  the  one-piece  Gompertz 
model  and  the  KM  estimates. 

Table  6-2.  Estimates  and  Standard  Deviations  of  Cure  Proportion  Generated  from 
the  Mixture  Model,  the  One-Piece  Gompertz  Model,  the  Three-Piece  Gompertz 
Model,  and  the  Kaplan-Meier  Estimates. 


Cure  Proportions  (Standard  Deviations) 
1-piece  3-piece 


Gender  Treatment  Mixture  Gompertz  Gompertz  KM 


M 

1 

0.558 

(0.059) 

0.504 

(0.062) 

0.568 

(0.056) 

0.648 

(0.076) 

M 

2 

0.386 

(0.050) 

0.445 

(0.060) 

0.397 

(0.050) 

0.387 

(0.065) 

M 

3 

0.407 

(0.033) 

0.409 

(0.050) 

0.405 

(0.033) 

0.388 

(0.036) 

F 

1 

0.597 

(0.056) 

0.533 

(0.062) 

0.596 

(0.054) 

0.531 

(0.071) 

F 

2 

0.424 

(0.051) 

0.475 

(0.061) 

0.429 

(0.051) 

0.421 

(0.065) 

F 

3 

0.445 

(0.036) 

0.440 

(0.051) 

0.437 

(0.035) 

0.466 

(0.041) 

6.3.3  Goodness-of-Fit 


From  the  above  discussion,  we  observe  that  the  three-piece  Gompertz  model 
fits  better  than  the  one-piece  Gompertz  model.  It  is  natural  to  compare  these  two 
models  by  applying  the  goodness-of-fit  test  we  derived  in  this  dissertation.  The  null 
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Figure  6-1.  Survival  curves  estimated  from  the  mixture  model  for  three  treatments, 
a)  Male;  b)  Female. 
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Figure  6-2.  Survival  curves  estimated  from  the  one-piece  Gompertz  model  for  three 
treatments,  a)  Male;  b)  Female. 
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Figure  6-3.  Survival  curves  estimated  from  the  three-piece  Gompertz  model  for 
three  treatments,  a)  Male;  b)  Female. 
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Figure  6-4.  Survival  curves  from  Kaplan-Meier  estimates  for  three  treatments,  a) 
Male;  b)  Female. 
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hypothesis  is  expressed  as 

Hq  : Alg  - \2g  = Xzg, 

for  any  g = 1,2,  and  3.  Note  that  \2g  and  \^g  are  parameters  for  the  second  and 
third  interval  in  the  three-piece  Gompertz  model,  respectively.  We  showed  that 
under  Hq 

-2{h  - h)  - Xi, 

where  and  I2  are  the  log-likelihood  functions  for  the  one-piece  and  the  three- 
piece  Gompertz  models,  respectively.  From  the  computation,  we  obtain  that 
li  = —1977.289  and  I2  = —1950.979.  Therefore,  the  test  is  highly  significant. 

It  implies  that  the  three-piece  model  is  significantly  better  than  the  one-piece 
Gompertz  model. 


CHAPTER  7 
SUMMARY 


In  this  dissertation,  we  presented  a piecewise  Gompertz  model  to  address 
long-term  survival  problem.  We  constructed  a standardized  test  statistic  based 
on  our  model  which  compares  cure  rates  between  treatment  groups.  We  derived 
the  strict  concavity  of  the  likelihood  function  based  on  the  model.  In  addition, 
we  derived  the  asymptotic  properties  of  the  MLEs  of  parameters  in  the  model.  A 
goodness-of-fit  test  was  also  derived  to  access  an  appropriate  number  of  intervals 
for  the  model. 

In  oncology  study  when  there  are  long-term  survivors  and  when  there  are 
different  hazard  patterns  on  different  time  intervals,  one  might  consider  using  our 
piecewise  Gompertz  model.  Our  model  is  more  computational  feasible  than  both 
the  mixture  model  and  the  semi-parametric  model  with  bounded  cumulative  hazard 
function.  In  addition,  our  model  is  more  flexible  due  to  its  piecewise  structure.  Our 
model  also  provides  a reliable  estimate  of  the  cure  rate  than  the  KM  estimate  when 
the  sample  size  is  small  at  the  end  of  the  study  period. 

In  the  future,  we  plan  to  find  an  optimal  partition  of  the  time  span  by 
deriving  a test  to  compare  different  partitions.  Sample  size  determination  will  also 
be  assessed.  Since  we  notice  that  there  are  jumps  on  the  survival  curve  defined 
by  the  piecewise  Gompertz  model,  we  might  consider  using  a log-spline  model  to 
achieve  smoothness  of  the  curve.  In  addition,  we  plan  to  perform  more  detailed 
simulation  studies  on  the  semiparametric  model  which  assumes  bounded  cumulative 
hazard  function  and  to  study  the  asymptotic  behavior  of  the  semiparametric  model. 
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APPENDIX  A 

DERIVATION  OF  MARGINAL  LIKELIHOOD 


Let 


and  define 


poo  poo 

JtQ=0  Jtn  — 1 j— j 


poo  poo 

^‘ = / • ■ 7 n /('ije""'''’®'*, 

Jtn-k  Jtn-l  i=n-k+l 


dtn—1 ' ■ ■ dtfi^k+ii  ^ 1, 2, . . . , n. 


As  /c  = 1, 


poo  n 

h=  - e”®"],  (A.l) 

Jtn-l 


and  when  k = 2, 


aOO 

n f{U)e~^^*''’^'dtndtn-i 

- - 1-1 i=n-l 

poo  poo 

J tn  — 2 "in— I 

poo  1 

Jtn-2 


1 


On(On-l  + On) 


Jg-(0n-l+©n)i^(tn-2)  _ g-(©n-l+0n)j 


-0n 


■[e' 


-0„_lF(tn-2)  p — ©n 


e-®-7. 


OnOn— 1 

By  similar  derivation,  we  obtain  that  when  k — 3, 

poo 

Ju-% 

j'g-(©n+©n-l+©n-2)F(t„_3)  _ g-(©n+©n-l+©n-2)j 


1 


On(©n  + O„_i)(0„  -I-  ©n-l  + ©n-2) 
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-On 


0n0n-l(©n-l  + 0n-2) 

g — (0n+0ri-l) 

0n-l©n-2(0n  + ©n-l) 
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g-0n-2^(<n-3)  _ g— 0n-2l 


Let 


/fc  = ^(-1)*  ^Ai^k[e  ^’^-k+hn-i+lF{tn-k)  _ g Sn- 


A;  + l,n— i+1 1 


(A.2) 


i=l 


then 


'fc+i 

fOO  ^ 
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/OO  ^ 

^ ^ ^ ^ — fc+l,n  — 1+1  ^(^n  — A:)  — fc+l,n  — i+1  j 

1 

^ ^ ^ ^g“*^n  — /e,n  — 1+1  ^(^n  — fc— 1 ) g~'^n  — fe,n— i+1  j 
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^ 1 
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©n— ' 


i=l 

Also  we  know  that 


k+l 


Mj,fc+l[e  ^n-k,n-i+lF(tn-k-l)  _ 5„_fc,„_i+i  | 


i=l 


= ^(-1)*  ^+,fc+l[e  ^^-k,n-i 
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i=l 
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(A.4) 


Equate  (A. 3)  and  (A.4),  we  obtain  that 


A, 


i,fc+l 
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^n—k,n—i+l 


and 
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Therefore  by  Equation  (A. 5) 


= A 


1 


^n—k+l,n—i+l^n—k,n—i+l 

1 


Q c 

>^n 


(A.7) 


^n—i,n—i+l^n—i—l,n—i+l  ‘ ' ' Sn—k+l,n—i+l  Pn—k+l,n—i+l 

for  i < A:  and  by  the  definition  of  Pa,b  given  in  Chapter  1.  In  addition  by  Equation 
(A.6), 


t=l 


Ak+i,k+i  = 

“7  ^n-k 
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= E(-i) 
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(A.8) 


Also  we  observe  that 


©n 

where  the  first  equality  is  true  by  Equation  (A. 2)  and  the  second  is  true  from 
Equation  (A.l).  Therefore 


^1, 


1 1 


*S'n,n  ©n 

Similarly  we  obtain  through  derived  Equation  (A.8) 

C p — Sn,n 
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where  P”  was  defined  earlier  in  Chapter  1 as  rij=n-fc+2  ^n-k+2,j-  In  addition, 
we  observe  from  Equation  (A. 8)  that 


fc-2 


1 '1 ^ C . , , ■ , ^P  — fc+2,n  — i+l 

/ pn-k+2,n—i+l 
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-}■ 


Sn-k+l,n-k+l  pn-i+2,npn-k+2,n-i+l 

Hence  from  equating  Equations  (A. 9)  and  (A. 10),  the  following  equation  holds. 
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Now  we  can  derive  Ajt+i,fc+i  as  follows: 
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by  Equation  (A.ll).  Therefore, 
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for  i < k.  Ik,  as  defined  in  Equation  (A. 2),  therefore  can  be  written  as 
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(A.13) 
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Set  k = n,  we  obtain 
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1=0 

by  Equation  (A.  11).  Hence  the  marginal  likelihood  function  defined  in  Equation 
(1.14)  can  be  derived  since  we  observe  that 

poo  poo 

L„=/  ■••/  L = n«f/„, 

Jto—0  Jtn-l  j_l 


where  L was  defined  in  Equation  (1.13). 


APPENDIX  B 
PROOF  OF  LEMMA  4.1 


The  results  presented  in  Lemma  4.1  hold  for  any  general  parametric  model  if 
it  satisfies  certain  regularity  conditions.  In  our  proof,  we  show  that  the  results  also 
hold  for  any  parametric  model  with  independent  censoring  under  certain  regularity 
conditions.  Let  T*  = min(T',  U)  be  the  observed  survival/censoring  time  and  A 
be  the  censoring  indicator.  Denote  the  hazard  function,  the  probability  density 
function,  and  the  survival  function  for  the  survival  time  T by  h{T-,0),  f{T-,6),  and 
5(T;0),  respectively.  Denote  the  survival  function  for  the  censoring  time  U by 
Gu{u).  Then  the  likelihood  function  is  written  as 

L{T*,A)  = h^{T*;0)S{T*-,d). 

Therefore  the  expectation  of  the  score  function  S{0)  has  the  following  expression: 


= E 


= E 


E[S{e)] 
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(B.l) 


We  can  write  the  first  term  on  the  right-hand  side  of  Equation  (B.l)  as 
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Similarly,  the  second  term  of  the  right-hand  side  of  Equation  (B.l)  can  be  written 


as 


^(h^(r-,e)s(r-,e)) 
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if 
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Hence, 


E[5(6>)]  = ^ [P{T  <U)  + P(T  > U)]  = 0. 

Now  we  need  to  prove  F[5(0)]  = D.  Let  and  jg{j§L)  represent  the 

(p  -f  IG)  X (p  q-  IG)  dimensional  matrices  formed  by  corresponding  second  order 
derivatives.  Note  that 
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It  is  clear  that  E[5(0)]  = D{6)  if  we  can  prove 
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Note  that  we  can  derive  E[j^^]  as 
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The  first  term  of  the  right-hand  side  of  Equation  (B.2)  can  be  written  as 
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Similarly,  the  second  term  of  the  right-hand  side  of  Equation  (B.2)  is  written  as 

poo  ^ poo 
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Jc  S{u-,e)  d$  Jo 
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The  combination  of  Equations  (B.3)  and  (B.4)  shows  that 

T fpT  fl2 


APPENDIX  C 
PROOF  OF  LEMMA  4.2 


We  will  prove  this  lemma  for  / = 1.  The  proof  for  / = 2 is  similar  and  is 
omitted.  Note  that 
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where  and  Gu{t)  are  survival  functions  of  the  survival  time  Tj  and  the 

censoring  time  Uj,  respectively.  Note  that 
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roo  i-t  , 

+ / exp{— / h{u-,X(i^K{j)^3{u))du]f^e^'9^dt.  (C.3) 

Jtj-i  Jo 

It  is  observed  that  the  first  term  in  Equation  (C.3)  is  uniformly  bounded  for  A 
satisfying  ||A'  — Ao||  < e.  The  second  term  in  Equation  (C.3)  can  be  written  as 
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since  ||xj||  < B for  any  j = 1, 2, . . . , n by  Assumption  1.  We  will  discuss  Equation 
(C.4)  in  the  following  three  cases; 

Case  1.  Xfg  < 0.  ||A'  — Ao||  < e implies  X'jg  < Xfg  + e < ^Xfg  < 0.  Therefore, 
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Case  2.  xfg  = 0.  The  following  inequality  holds. 
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Case  3.  xfg  > 0.  Equation  (C.4)  can  be  written 
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since  X'jg  is  uniformly  bounded.  Hence,  there  exists  a F > 0 such  that 

Eoo{ J A',K(j),  J(«))du|  < F, 

for  X'  satisfying  ||A'  — Aoll  < e. 


APPENDIX  D 
PROOF  OF  LEMMA  4.3 


Note  that  for  0 < 5 < oo, 


poo  pt 

- - [ u^h\u-,Xo,K,{j),3{u))du\d[Snj{t-,Oo)Gu{t)] 

Jo  Jo 

poo  pS 

> - [ u^h^{u;XQ,K{j),3{u))du]d[Snj{t-,9o)Gu{t)] 

Js  Jo 

pS  poo 

= - / u"^h^{u]Xo,K{j),3{u))du  / d[Snj{t]9o)Gu{t)] 

Jo  Js 

- [J  u'^h\u-,Xo,K{j),3{u))du][Snj{S;9o)Gu{S)  - Snj{oo-,9o)Gu{oo)]. 

We  observe  that 

Snj{5-,9o)Gu{S)  - Snj{oo-,9o)Gu{oo)  > Snj{S-,9o)[Gu{6)  - G'u(oo)] 

pS 

> exp{-e^ll^“M  h{u-,Xo,K{j),3{u))du}[Gu{S)-Gu{oo)] 

Jo 

— >■  1 — G[/(oo), 

as  (5  0 uniformly  for  j = 1, 2, . . . , n.  Therefore,  we  can  find  a > 0 and  an  e > 0 

such  that 

Snj{So-,9o)Gu{do)  - Snj{oo;9o)Gu{oc)  > e 


and 


pSo 

L 


vJ^h:{u\  Ao,  K{j),3{u))du  > e. 


Thus  the  expectation  term  in  Equation  (4.1)  > for  all  j = 1, 2, . . . , n.  If  we  take 
7 = e^,  the  lemma  is  proved. 
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APPENDIX  E 
PROOF  OF  LEMMA  4.4 


Let  = (a^,  b^)  be  a (p  + /G) -dimensional  unit  vector,  where  a is 
p-dimensional  and  b^  = {bu, . . . , bn, . . . , bic,  ■ ■ ■ , bjoY  is  /G-dimensional.  It 
suffices  to  show  that 

c^DnC  > wy, 

for  any  unit  vector  c.  We  showed  in  Chapter  3 that 

YDjiC  = / h{u;  A,  /c(j),  J(u))(a^Xj  + u\YQ,{u,  j))^du. 

j=\  •'o 

By  expanding  the  expectation  in  the  above  equation,  we  obtain 


c^D„c 

^ noo  f ft  o') 

= ly  h(u; A,/c(j), J(u))(a'^Xj  + ub^Q(u, j))  du|dFr;(t) 

= h{u-,X,K{j),3{u)){a^yij + u\YQ{u,j)Ydu'^d[Sj{t)Gu{t)]. 


Since  l|x,ll  < B by  Assumption  1, 


C^DnC 

^ fOO  ft 

> / / h{u-,X,K{j),J{u)){a^Xj  + u)YCl{u,j))  du  d[Sj{t)Gu{t)] 

Jo  Jo 

G poo  ft 

= Y2  - / h{u-,X,K{j),J{u)){a^y:j + u\YQ{u,j))  du 

9=1  jegroup  g ^ ^ 

d[SYt)Gu{t)]. 

(E.l) 
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Assume  that  the  j'"  individual  belongs  to  the  treatment  group  g,  then 

/ 

ub  Q('U,j)  ==  ^ ^ ^ig^ue[Ti-i,Tj)] 


i=l 


and 


i=l 


Note  that  there  exist  a go  where  I < go  < G and  a io  where  I < io  G such  that 


*090  — JQ 


1 


IG 


(E.2) 


We  will  discuss  the  behavior  of  the  right-hand  side  of  Equation  (E.l)  in  the 
following  four  cases. 


Case  1.  ||a|p  < 2(8B^g/+t^)  ^ treatment 


group  ^0, 


poo  pi 

- / h{u;X,K{j),J{u))[a^yij + uh^Q,{u,j)]^dud[Sj{t)Gu{t)] 

Jo  Jo 

poo  pT\ 

> - / h{u-,X,K{j),3{u))[a^Xj+uh^Q{u,j)Ydud[Sj{t)Gu{t)]. 

J Tt  J hr-i 


Because  u > |ti,  ||x||  < B,  and  ^ , we  observe  that 

[a^Xj  +ub^Q(u,  j)]2  > hub^CliuJ)]'^  - (a^Xj) 


1 


1 


> -^(1- llain-B^llalp  > 

- 8IG^  " " ’ " " - 16/G 


Hence, 


> 


poo  pt 

- / h{u]  X,K{j),J{u))[a^Xj  + uh^Q{u,j)fdu  d[Sj{t)Gu{t)] 

Jo  Jo 

2 poo  pTi 

~T^lJr  lj,^^''^’^''du]d[Sj{t)Gu{t)] 

- e5»-*.")[S,(rOGt;(Ti)  - S,(oo)G„(c»)l 
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> 


> 


16G/  AigQ 


1 ^ ^g'^lSo'’’!  _ exp{— f e^^®o“(iu}[G{/(ri)  — Gt/(oo)] 

GI  Algo  Jq 


16GI  Algp 
7i  > 0, 


by  Assumption  4. 

2 

Case  2.  ||a||^  < 2{8b^gi+t^)  ^ treatment  group  go, 

-IM  h{u]  A,k(j),  J(u))[a^Xj  +ub^Q,{u,j)]^du^d[Sj{t)Gu{t)] 

- {/  + uh^Q{fJ'J)?du^d[Sj{t)Gu{t)] 

Jti-I  ''Jtj-1 

roo  . pTi-i+S  ^ 

> - { h{u-,X,K{j),J{u))[a^:x.j + uh^Cl{u,j)]^du\d[Sj{t)Gu{t)]. 

Jtj-i+S 

Since  u > r/_i  > |ti  in  the  above  expression,  with  the  additional  conditions  that 


i-INI 
igo  — IG 


and  ||xj||  < B for  any  j = 1,  2, . . . , n,  we  obtain  that 


[a^Xj + ub^Q(u,j)]  > 


2 \ '’'i 


16IG 


Therefore, 


/>00  pTj-i+S  ■. 

- {/  h{u-,X,K{j),J{u))[a^yij + uh^Q{u,j)fdu\d[Sj{t)Gu{t)] 

i_(e^/9oC^-i+<^)  - e^’^o'^'-^)[Sj{Ti-i  + 6)Gu{ti-i  + 5)  - 5j(oo)G[/(oo)] 


> 


> 


16G7  A/gp 


+ 6)[Gu{t,-,  + S)  - G„(oo)|. 

iOGi  A/go 


(E.3) 


Note  that 


Sj(r/_i  + S)[Gu{ti-i  + (5)  — G£/(oo)] 

Sj(Tl-l)[Gu{Tl-l)  — Gj/(oo)] 

> exp{-e®ll^ll  / /i(u;  A, /c(j),  J(u))dn}[G[;(r/_i)  - G[/(oo)]  > e > 0, 
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where  Assumption  4 is  applied.  Therefore,  there  exists  a 5 > 0 such  that 

- J ly  /i(u;  A,  K(j),  J(u))[a^Xj  + uh^ Q{u,  j)]‘^du'^d[Sj{t)Gu{t)]  > 72  > 0. 

2 

Case  3.  |la|p  < 2{s b^gi+t^)  I <io  < I-  Note  that  for  u G [Ti^-\,Ti^)  and 

for  each  j in  treatment  group  go,  the  following  inequality  also  holds: 

^2 


[a^x,+nb^Q(u,;)f  >^. 


Therefore, 


-/  [J  h{u-,X,K{j),J{u))[a^y.j + uh^Q{u,j)fdu^d[Sj{t)Gu{t)] 

poo  pTig 

> — / h{u;X,K{j),J{u))[a^Xj  + uh^Q{u,j)]‘^dud[Sj{t)Gu{t)] 

Jn^  JriQ-i 


> 


> 


> 


16GI 

1 

16G/ 


e^'o9o^du[Sj{Tio)Gu{Tio)  - 5j(oo)Gt;(oo)] 


T-f  1 

16G/  Aj(jgo 

[Gu{Tio)  ~ G'f/(oo)] 


.(e^iosoUo  _ eNo9o^io-i)5'^.(riJ[Gt/(riJ  - G[/(oo)] 

I 

(e'^»o9o'^‘o  — 6^090 ■^•0-1)  / h{w,X,K{j),J{u))du} 

Jo 


= 73  > 0, 


where  Assumption  4 is  used  again. 

By  combining  the  above  three  cases,  we  obtain  that 

c^DnC  > . 

where  j'  = min(7i,  72, 73)  and  Ug^  is  the  number  of  individuals  in  group  ^o-  Under 
Assumption  3 the  following  holds: 

• r 

mm  — > di, 
l<j<n  n 
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when  n is  sufficiently  large.  Therefore, 

c^DnC  > = n'y 


where  7 = 5iVe 


Case  4.  ||a|r  > 


2{SB'^GI+Tf) 


say 


el 


Note  that 


|ub^Q(«,  j)|  < \u\GI. 


Therefore, 

(a^x,  + !ib''’Q{ti,  j))^  > l(a’’x,)^  - [tib^Q(a,  j)]^ 


By  Assumption  2,  there  exist  an  e > 0,  a 5 > 0,  and  a no  > 0.  When  n > no,  there 
exists  a subset  A of  {1,2,  ...,n}  with  size  ua  > nS  such  that 

T 

I AtXjI  > e for  j € A.  (E-4) 

HI  '' 


Now  we  assume  n > uq  and  (E.4)  is  satisfied.  Therefore,  if  we  denote  by  Si,  for 
u e (0,5i], 


[a^Xj  + nb^Q(u,  j)]^  > -(a^Xj)^  — 


> 

1 


2,2 


-€  e 


Hence, 


C^DnC 

A(«;A.K(i),J(a))^/K(o.,,ll<i«}<i[Sj(t)G„(t)]. 


Without  loss  of  generality,  let  us  assume  tii  < Ti.  For  the  individual  in 
treatment  group  g, 
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/OO  -2  2 

[ J h{u]  K{j),  J{u))—^I[ue{0,5i]]du\d[Sj{t)Gu{t)] 

poo  pSi  2,2 

> -y  [J  h{u]X,K{j),3{u))^du]d[Sj{t)Gu{t)] 

> - 5,(oo)Gc,(oo)]  > 7^  > 0, 

where  Assumption  4 is  used.  Note  that  7^  depends  on  g.  If  we  take 


7o  = min  7„  > 0 and 
1<5<G  « 


denote  ^706  by  7,  then 


c^DnC  > e > e = ’^7- 


APPENDIX  F 
PROOF  OF  LEMMA  4.5 


From  Lemma  4.1,  we  observe  that 
E[c^D-^l^[Fr,{e,)  - = c^D-^l\Dn  - Dr;\D-^l\  = 0. 


In  addition  we  have 
,=1  -^0 


-1/2 

n 


J=1 

DZ^^'^cdu. 


\ 


uQ{u,j) 


xj  uQ'^iuJ) 


1 /rt 

We  showed  in  Lemma  4.4  that  Amin(D„)  > ri'y.  This  implies  that  Xmax{Dn  ' ) < 
-F=.  Therefore, 

y/fpy  ’ 


< 


( ^ 

^uQ{u,j)  ^ 

(xj  uQ 

\ 

Y • 

yxj  uQ^{u,j)  j 

uQ{u,j)  ^ 

( 

\ 

D-'l^c  = 

/ 

/ 

(F.l) 


Since  T*,  j = 1, 2, . . . , n are  independent. 


J=1 


/ \ 

pT* 

1 / ' /i(u;  Ao,K(j), 

X, 

'■A 

(xj  uQ^{u,j)) 
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/ 


)-l/2 


\ 


^uQ{u,j)  ^ 


< [ /i(u;  Ao,K(j),  J(m))c^D„ 

j=i  Jo 

D-‘/’cd«}' 

< f ' /i(u;Ao,KO),J(ti))— (B^  + /Gti^)dti}^ 

maxE'l  / u’^/i(u;  Ao,  K(j),  J(u))(iw  > 

^m=0,2  1^0  J 


(xj  uQ^(«,i)) 


< e2B||j3o|| 


(B2  + iGf 


n^'f- 


< e2B||/3o|| 

= 0(n->). 


(B^  + IG)'- 


r)2^2 


nT 


The  last  inequality  holds  by  Lemma  4.2  and  F is  a constant  depending  on  Aq. 


Therefore,  for  any  e > 0, 

B(|c’’£);‘/"(E„(e„)  - > e)  < ^ ^ 0, 


as  n ^ 00  by  the  Chebyshev’s  inequality.  This  implies 


c^D-^^^[Fn{eo)  - D,,]D-^/^c  4 0. 


APPENDIX  G 
PROOF  OF  LEMMA  4.6 


Let  0^  = {f3^,X^)  and  9^  = (/3q, Ag),  then 


c^D-^^^[Fr,{9)-Fn{0o)]D-^^^c 

= f J(^))  - Ao,k(j),  j(w))] 

j=i  -^0 


/ 


By  Taylor’s  expansion, 


\ 


uQU)  ! 


(x^  uQ^(;))D„^/2cdu. 


(G.l) 


[e^"^^h{u-,  A,  k(j),  J(^))  - e^"^^h{u-  Ag,  K{j),3{u))] 

= [xJ(/3-/3g)  +m1^(A-  Ao)]e^j'’'^'/i(u;Aj,K;(j),  J(u)), 

r\i 

where  1 is  a /G-dimensional  vector  of  1 and  (/3j,  AJ)  = cxj{0^,  Ag ) + (1  — 
aj){0^,X^)  for  0 < aj  < 1.  By  Equation  (4.3)  and  Assumption  1, 

{/3  - /3g)  + ul^( A - Ag)  I < (R  + IGu) ^ 

which,  together  with  (F.l),  implies  that  the  absolute  value  of  the  integrand  in  (G.l) 

< (R  + /Gu)4=\/-e^"^“"  — (5"  + ^Gu2)/i(u;A„K(i),J(u)).  (G.2) 

y 7 

By  Lemma  4.2, 

' {B  + /Gu)^^e^ll^°ll^(R2  + IGu^)h{u-  Xj,  k(j),  J(«))du} 
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no 


< n(B  + /G)(52  + /G)e®ll^»ll 


— max  L 

ri\/n7v^  o<m<3 


n 


s/n^y/j 


u;  Xj,  K{j),J{u))du 


= 0(n-'/2). 


Furthermore,  by  the  independence  of  T*s, 


V[c^D-^/\Fr,{0)  - Fr,{9o))D-^^^c\ 

= [e^"^^h{u-,X,K{j),J{u))  - e^^-^h{u-Xo,K{j),J{u))] 

.7=1  -^0 


/ \ 


\uQ{j) 

where  the  integrand  in  the  above  equation  is  the  same  as  in  (G.l).  Hence,  from 
Equation  (G.2)  and  Lemma  4.2, 

V[eD-'/^{F„{0)  - F„{eo))D-'/^ 

j=l 

I \ 


\uQ{j)  j 


(x^  uQ^(i))D„^/2cdu]‘ 


< {B  + IGf{B‘^  + IGf[ 


< [B^IGf[B‘^  + IGf-^nT 


= 0{n-% 


ny^'Yy/j  ^ o< 
A 

n^j 


n pT- 

max  £'[  / u^h{u-,Xj,K{j),J{u))du]‘ 

^ 0<m<3  '~Jq 


Therefore, 

B[c^D-'/"(F„(9„)  - F„(9o))F;''V]" 

= V[c^D-'IHFM  - F,(«o))H„-‘'"c]  + [Ec^C-‘/^(F„(9„)  - F„(9„))£)-*'"c]" 

= 0(n-'). 


Ill 


■ ■ Hence,  the  lemma  is  proved  since  mean  square  convergence  implies  convergence  in 

?**  • 

<- 

probability. 


V 

I 


•.g 


APPENDIX  H 
PROOF  OF  LEMMA  4.7 


From  Lemmas  4.5  and  4.6, 


- D„)D-^'^c  A 0, 


(H.1) 


where  c is  a (p  + 7G)-dimensional  unit  vector.  As  a result  of  Equation  (H.l), 


A 1, 


which  implies  that 


lim  = |im  = 1 


n— >00 


in  probability.  Therefore, 


DZ'I^F(B)D-^I^  4 I(p+,G)x(p+,0). 
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APPENDIX  I 
PROOF  OF  LEMMA  4.8 

It  suffices  to  show  that  for  each  A:,  A:  = 1,  2, . . . ,p  + IG, 

Eeo\\Snjk{9)\\' 

is  uniformly  bounded  for  j = 1, 2, . . . , n,  where  Snjk{0)  is  the  component  of 
the  vector  Snj{9)-  From  Equations  (3.2)  and  (3.3),  we  observe  that  \Snjk{0)\  is  less 
than  or  equal  to  the  maximum  of  the  following  two  quantities; 

R + [ h{u-,Xo,K{j),3{u))du 

' h{u;Xo,K{j),3{u))du. 

Jo 

Note  that  Tj  is  the  survival  time  of  the  individual  and  T*  = min(Tj,  Uj),  where 
Uj  is  the  censoring  time  associated  with  the  individual.  To  show  Eeg\Snjk\ 
is  bounded  uniformly  for  j,  it  suffices  to  show  that  E[Tj]^  is  uniformly  bounded 
since  we  have  shown  the  uniform  boundedness  of  Ee^  Jq  ^ h{u-,  Aq,  K{j),3{u))du  in 
Lemma  4.2.  Assume  that  the  individual  belongs  to  the  treatment  group  g,  then 

roo 

E[Tj]^  = / t^h{u-,Xo,K{j),3{u))e^o^^Snj{t;0o)dt 

Jo 

< r'“A(t;A„,/c(j),J(u))(ii 

Jo 

poo 

+e^ll^oll  / t^h{u-,Xo,K{j),3{u))Snj{t-,do)dt.  (LI) 

Jri-i 

Note  that  the  first  term  in  Equation  (1.1)  is  uniformly  bounded  for  any  j.  The 
second  term  is  the  same  as  in  Equation  (C.4)  where  the  parameters  are  replaced  by 
their  true  value.  Hence,  the  uniform  boundedness  is  proved. 
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APPENDIX  J 
SIMPSON’S  FORMULA 


Suppose  we  want  to  approximate  one  integral  by  dividing  the  integration 
interval  to  Nq  points.  Assume  a is  the  lower  bound  of  the  integral,  b is  the  upper 
bound  of  the  integral,  and  the  integration  form  we  are  interested  is 

[ f{c)dc,  (J.l) 

J a 

then  we  will  divide  the  interval  (a,  b)  into  Nq  intervals  with  the  end  points  of  the 
intervals  represented  as  where  xq  = a,  xatq  = b,  Xj  = a + jh,  and  h = 

The  numerical  approximation  of  (J.l)  is  then  written  as 

, No-l  No-1 

f{c)dc  = -{f{xo)  + f{xNo)  + 2 ^ f{xj)  + 4 ^ 

° j=i  j=o 
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